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Preface 


Differential geometry has a long and glorious history. As its name implies, it is the 
study of geometry using differential calculus, and as such, it dates back to Newton 
and Leibniz in the seventeenth century. But it was not until the nineteenth century, 
with the work of Gauss on surfaces and Riemann on the curvature tensor, that dif- 
ferential geometry flourished and its modern foundation was laid. Over the past one 
hundred years, differential geometry has proven indispensable to an understanding 
of the physical world, in Einstein’s general theory of relativity, in the theory of gravi- 
tation, in gauge theory, and now in string theory. Differential geometry is also useful 
in topology, several complex variables, algebraic geometry, complex manifolds, and 
dynamical systems, among other fields. It has even found applications to group the- 
ory as in Gromov’s work and to probability theory as in Diaconis’s work. It is not 
too far-fetched to argue that differential geometry should be in every mathematician’s 
arsenal. 

The basic objects in differential geometry are manifolds endowed with a metric, 
which is essentially a way of measuring the length of vectors. A metric gives rise 
to notions of distance, angle, area, volume, curvature, straightness, and geodesics. 
It is the presence of a metric that distinguishes geometry from topology. However, 
another concept that might contest the primacy of a metric in differential geometry 
is that of a connection. A connection in a vector bundle may be thought of as a 
way of differentiating sections of the vector bundle. A metric determines a unique 
connection called a Riemannian connection with certain desirable properties. While 
a connection is not as intuitive as a metric, it already gives rise to curvature and 
geodesics. With this, the connection can also lay claim to be a fundamental notion 
of differential geometry. 

Indeed, in 1989, the great geometer S. S. Chern wrote as the editor of a volume 
on global differential geometry [5], “The Editor is convinced that the notion of a 
connection in a vector bundle will soon find its way into a class on advanced calculus, 
as it is a fundamental notion and its applications are wide-spread.” 

In 1977, the Nobel Prize-winning physicist C. N. Yang wrote in [23], “Gauge 
fields are deeply related to some profoundly beautiful ideas of contemporary 
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mathematics, ideas that are the driving forces of part of the mathematics of the last 
40 years, ..., the theory of fiber bundles.” Convinced that gauge fields are related to 
connections on fiber bundles, he tried to learn the fiber-bundle theory from several 
mathematical classics on the subject, but “learned nothing. The language of modern 
mathematics is too cold and abstract for a physicist” [24, p. 73]. 

While the definition and formal properties of a connection on a principal bundle 
can be given in a few pages, it is difficult to understand its meaning without knowing 
how it came into being. The present book is an introduction to differential geometry 
that follows the historical development of the concepts of connection and curva- 
ture, with the goal of explaining the Chern—Weil theory of characteristic classes on 
a principal bundle. The goal, once fixed, dictates the choice of topics. Starting with 
directional derivatives in a Euclidean space, we introduce and successively general- 
ize connections and curvature from a tangent bundle to a vector bundle and finally to 
a principal bundle. Along the way, the narrative provides a panorama of some of the 
high points in the history of differential geometry, for example, Gauss’ Theorema 
Egregium and the Gauss—Bonnet theorem. 

Initially, the prerequisites are minimal; a passing acquaintance with manifolds 
suffices. Starting with Section 11, it becomes necessary to understand and be able to 
manipulate differential forms. Beyond Section 22, a knowledge of de Rham coho- 
mology is required. All of this is contained in my book An Introduction to Manifolds 
[21] and can be learned in one semester. It is my fervent hope that the present book 
will be accessible to physicists as well as mathematicians. For the benefit of the 
reader and to establish common notations, we recall in Appendix A the basics of 
manifold theory. In an attempt to make the exposition more self-contained, I have 
also included sections on algebraic constructions such as the tensor product and the 
exterior power. 

In two decades of teaching from this manuscript, I have generally been able to 
cover the first twenty-five sections in one semester, assuming a one-semester course 
on manifolds as the prerequisite. By judiciously leaving some of the sections as 
independent reading material, for example, Sections 9, 15, and 26, I have been able 
to cover the first thirty sections in one semester. 

Every book reflects the biases and interests of its author. This book is no excep- 
tion. For a different perspective, the reader may find it profitable to consult other 
books. After having read this one, it should be easier to read the others. There are 
many good books on differential geometry, each with its particular emphasis. Some 
of the ones I have liked include Boothby [1], Conlon [6], do Carmo [7], Kobayashi 
and Nomizu [12], Lee [14], Millman and Parker [16], Spivak [19], and Taubes [20]. 
For applications to physics, see Frankel [9]. 

As a student, I attended many lectures of Phillip A. Griffiths and Raoul Bott 
on algebraic and differential geometry. It is a pleasure to acknowledge their influ- 
ence. I want to thank Andreas Arvanitoyeorgos, Jeffrey D. Carlson, Benoit Charbon- 
neau, Hanci Chi, Brendan Foley, George Leger, Shibo Liu, Ishan Mata, Steven Scott, 
and Huaiyu Zhang for their careful proofreading, useful comments, and errata lists. 
Jeffrey D. Carlson in particular should be singled out for the many excellent pieces 
of advice he has given me over the years. I also want to thank Bruce Boghosian for 
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helping me with Mathematica and for preparing the figure of the Frenet—Serret frame 
(Figure 2.5). Finally, I am grateful to the Max Planck Institute for Mathematics in 
Bonn, National Taiwan University, and the National Center for Theoretical Sciences 
in Taipei for hosting me during the preparation of this manuscript. 


Medford, MA, USA Loring W. Tu 
April 2017 
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Chapter 1 


Curvature and Vector Fields 


By a manifold, we will always mean a smooth manifold. To understand this book, it 
is helpful to have had some prior exposure to the theory of manifolds. The reference 
[21] contains all the background needed. For the benefit of the reader, we review in 
Appendix A, mostly without proofs, some of the definitions and basic properties of 
manifolds. 

Appendix A concerns smooth maps, differentials, 
vector fields, and differential forms on a manifold. 
These are part of the differential topology of mani- 
folds. The focus of this book is instead on the dif- 
ferential geometry of manifolds. Now a manifold 
will be endowed with an additional structure called 
a Riemannian metric, which gives a way of mea- 
suring length. In differential geometry, the notions 
of length, distance, angles, area, and volume make 
sense, whereas in differential topology, since a mani- 
fold can be stretched and still be diffeomorphic to the 
original, these concepts obviously do not make sense. 

Some of the central problems in differential ge- Bernhard Riemann 
ometry originate in everyday life. Consider the prob- 
lem in cartography of representing the surface of the 
earth on a flat piece of paper. A good map should 
show accurately distances between any two points. Experience suggests that this is 
not possible on a large scale. We are all familiar with the Mercator projection which 
vastly distorts countries near the north and south poles. On a small scale there are 
fairly good maps, but are they merely approximations or can there be truly accurate 
maps in a mathematical sense? In other words, is there a distance-preserving bijec- 
tion from an open subset of the sphere to some open subset of the plane? Such a map 
is an isometry. 

Isometry is also related to a problem in industrial design. Certain shapes such as 
circular cylinders and cones are easy to manufacture because they can be obtained 
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from a flat sheet by bending. If we take a sheet of paper and bend it in various ways, 
we obtain infinitely many surfaces in space, and yet none of them appear to be a 
portion of a sphere or an ellipsoid. Which shapes can be obtained from one another 
by bending? 

In 1827 Carl Friedrich Gauss laid the foundation for the differential geometry 
of surfaces in his work Disquisitiones generales circa superficies curvas (General 
investigation of curved surfaces). One of his great achievements was the proof of 
the invariance of Gaussian curvature under distance-preserving maps. This result 
is known as Gauss’s Theorema Egregium, which means “remarkable theorem” in 
Latin. By the Theorema Egregium, one can use the Gaussian curvature to distin- 
guish non-isometric surfaces. In the first eight sections of this book, our goal is to 
introduce enough basic constructions of differential geometry to prove the Theorema 
Egregium. 
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A Riemannian metric is essentially a smoothly varying inner product on the tangent 
space at each point of a manifold. In this section we recall some generalities about 
an inner product on a vector space and by means of a partition of unity argument, 
prove the existence of a Riemannian metric on any manifold. 


1.1 Inner Products on a Vector Space 


A point u in R? will denote either an ordered triple (u!,u*,u°*) of real numbers or a 
column vector 


In terms of this, one can define the length of a vector 


IIlvll= Vv), (1.1) 


the angle @ between two nonzero vectors (Figure 1.1) 


(u,v) 


cos 90 = 
NeaiA al 


, O<OK<zq, (1.2) 


and the arc length of a parametrized curve c(t) in R?,a<t <b: 


b 
7 Ile! (t) ||. 
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v 


Fig. 1.1. The angle between two vectors. 


Definition 1.1. An inner product on a real vector space V is a positive-definite, 
symmetric, bilinear form ( , ): Vx V +R. This means that for u,v,w € V and 
a,beER, 


(i) (positive-definiteness) (v,v) > 0; the equality holds if and only if v = 0. 
(ii) (symmetry) (u,v) = (vu). 
(iii) (bilinearity) (au + bv,w) = a(u,w) + b(v,w). 


As stated, condition (iii) is linearity in only the first argument. However, by the 
symmetry property (ii), condition (iii) implies linearity in the second argument as 
well. 


Proposition 1.2 (Restriction of an inner product to a subspace). Let ( , ) be an 
inner product on a vector space V. If W is a subspace of V, then the restriction 


(, w:=(,)|lwxw: WxwoR 


is an inner product on W. 


Proof. Problem 1.3. 


Proposition 1.3 (Nonnegative linear combination of inner products). Let ( , );, 
i=1,...,7, be inner products on a vector V and let a\,...,a, be nonnegative real 
numbers with at least one a; > 0. Then the linear combination ( , ) := Yaj(, )i is 
again an inner product on V. 


Proof. Problem 1.4. 


1.2 Representations of Inner Products by Symmetric Matrices 


Let e),...,é@, be a basis for a vector space V. Relative to this basis we can represent 
vectors in V as column vectors: 
1 1 


Yixlej x= : |, Yyeoy= : 
x" y" 
By bilinearity, an inner product on V is determined completely by its values on a set 
of basis vectors. Let A be the n x n matrix whose entries are 


aij = (e,e;)- 
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By the symmetry of the inner product, A is a symmetric matrix. In terms of column 


vectors, 
(¥ xe, ¥.y/e;) = Yi ajjx'y! =x! Ay. 
Definition 1.4. Ann x n symmetric matrix A is said to be positive-definite if 


(i) x’ Ax > 0 for all x in R”, and 
(ii) equality holds if and only if x = 0. 


Thus, once a basis on V is chosen, an inner product on V determines a positive- 
definite symmetric matrix. 

Conversely, if A is an n x n positive-definite symmetric matrix and {e1,...,é,} 
is a basis for V, then 


(Y xe, yi y'e:) = Yi ajjx'y! =x! Ay 
defines an inner product on V. (Problem 1.1.) 
It follows that there is a one-to-one correspondence 


inner products on a vector n Xn positive-definite 
space V of dimension n symmetric matrices 


The dual space VY of a vector space V is by definition Hom(V,R), the space of 
all linear maps from V to R. Let a!,...,a” be the basis for VY dual to the basis 
€1,-.-,€n for V. If x = Sx'e; € V, then a(x) =x!. Thus, with x = Dx/e;, y= Dy/e;, 
and (e;,e;) = aij, one has 


(x,y) = Yi ajjx'y! = Ya; jo (x)o/(y) 
= aj; (a! @ a/)(x,y). 
So in terms of the tensor product, an inner product ( , ) on V may be written as 
(,)= Yiajo' @ a, 


where [a;;] is an n x n positive-definite symmetric matrix. 


1.3 Riemannian Metrics 


Definition 1.5. A Riemannian metric on a manifold M is the assignment to each 
point p in M of an inner product ( , ), on the tangent space 7,,M; moreover, the 
assignment p++ (, )» is required to be C® in the following sense: if X and Y are 
C™ vector fields on M, then p++ (Xp,Yp)p is aC” function on M. A Riemannian 
manifold is a pair (M,(, )) consisting of a manifold M together with a Riemannian 
metric (, ) on M. 


The length of a tangent vector v € T,M and the angle between two tangent vectors 
u,v € T,M on a Riemannian manifold are defined by the same formulas (1.1) and 
(1.2) as in R?. 
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Example 1.6. Since all the tangent spaces T,,R” for points p in R” are canonically 
isomorphic to IR”, the Euclidean inner product on R” gives rise to a Riemannian 
metric on R”, called the Euclidean metric on R". 


Example 1.7. Recall that a submanifold M of a manifold N is said to be regular if 
locally it is defined by the vanishing of a set of coordinates [21, Section 9]. Thus, 
locally a regular submanifold looks like a k-plane in R”. By a surface M in R? 
we will mean a 2-dimensional regular submanifold of R?. At each point p in M, 
the tangent space 7;,M is a vector subspace of T,R?. The Euclidean metric on R? 
restricts to a function 


(, )u: TyM x TyM > R, 


which is clearly positive-definite, symmetric, and bilinear. Thus a surface in R? 
inherits a Riemannian metric from the Euclidean metric on R?. 


Recall that if F: N — M is aC map of smooth manifolds and p € N is a point 
in N, then the differential F,: T,N — Ty:,)M is the linear map of tangent spaces 
given by 

(F.Xp)g = X,(g ° F) 


for any X, € T,N and any C” function g defined on a neighborhood of F(p) in M. 


Definition 1.8. AC” map F: (N,(, )’) > (M,(, )) of Riemannian manifolds is 
said to be metric-preserving if for all p € N and tangent vectors u,v € T,N, 


(u,V)) = (Fat, Fv) p(p)- (1.3) 
An isometry is a metric-preserving diffeomorphism. 


Example 1.9. If F: N — M is a diffeomorphism and ( , ) is a Riemannian metric on 
M, then (1.3) defines an induced Riemannian metric (, )’ on N. 


Example 1.10. Let N and M be the unit circle in C. Define F: N — M, a 2-sheeted 
covering space map, by F(z) = z”. Give M a Riemannian metric ( , ), for example, 
the Euclidean metric as a subspace of IR, and define ( , )/ on N by 


(v,w)' = (Fv, Few). 


Then (, )/ is a Riemannian metric on N. The map F: N > M is metric-preserving 
but not an isometry because F is not a diffeomorphism. 


Example 1.11. A torus in R°? inherits the Euclidean metric from R*. However, a 
torus is also the quotient space of R? by the group Z? acting as translations, or to 
put it more plainly, the quotient space of a square with the opposite edges identi- 
fied (see [21, §7] for quotient spaces). In this way, it inherits a Riemannian met- 
ric from R?. With these two Riemannian metrics, the torus becomes two distinct 
Riemannian manifolds (Figure 1.2). We will show later that there is no isometry 
between these two Riemannian manifolds with the same underlying torus. 
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Fig. 1.2. Two Riemannian metrics on the torus. 


1.4 Existence of a Riemannian Metric 


A smooth manifold M is locally diffeomorphic to an open subset of a Euclidean 
space. The local diffeomorphism defines a Riemannian metric on a coordinate open 
set (U,x!,...,x”) by the same formula as for R”. We will write 0; for the coordinate 
vector field 0/dx!. If X = Ya'o; and Y = Y b/d );, then the formula 


Gy = yah (1.4) 


defines a Riemannian metric on U. 

To construct a Riemannian metric on M one needs to piece together the Rieman- 
nian metrics on the various coordinate open sets of an atlas. The standard tool for 
this is the partition of unity, whose definition we recall now. A collection {Sq} of 
subsets of a topological space S is said to be locally finite if every point p € S has a 
neighborhood U, that intersects only finitely many of the subsets Sy. The support of 
a function f: S — R is the closure of the subset of S on which f F 0: 


supp f =cl{x € S| f(x) £0}. 


Suppose {Uq}qea is an open cover of a manifold M. A collection of nonnegative 
C™ functions 
Pa: MR, aeéA, 


is called a C® partition of unity subordinate to {Uq} if 


(i) supp Pa C Ug for all a, 
(ii) the collection of supports, {supp Pa }aca, is locally finite, 


(ii) Seca Pa = 1. 
The local finiteness of the supports guarantees that every point p has a neighborhood 


U, over which the sum in (iii) is a finite sum. (For the existence of a C™ partition of 
unity, see [21, Appendix C].) 


Theorem 1.12. On every manifold M there is a Riemannian metric. 


Proof. Let {(Uo,o,)} be an atlas on M. Using the coordinates on Ug, we define 
as in (1.4) a Riemannian metric ( , )g on Ug. Let {pq} be a partition of unity sub- 
ordinate to {Ug}. By the local finiteness of the collection {supp Pq}, every point 
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p has a neighborhood U, on which only finitely many of the p,’s are nonzero. 
Thus, )Pa(, )« is a finite sum on U,. By Proposition 1.3, at each point p the 
sum > Po(, )o is an inner product on T,M. 

To show that ¥pa(, )g is C°, let X and Y be C® vector fields on M. Since 
YPa(X,Y)q is a finite sum of C* functions on U,, it is C* on U,. Since p was 
arbitrary, Y Pa(X,Y)o is C? on M. 


Problems 


1.1* Positive-definite symmetric matrix 
Show that if A is an n x n positive-definite symmetric matrix and {e1,...,én} is a basis for V, 


then 
(Xe: Dy'ei) = ¥ aijx'y! = x’ Ay 
defines an inner product on V. 


1.2 Inner product 
Let V be an inner product space with inner product (, ). For u,v in V, prove that (u,w) = (v,w) 
for all w in V if and only if u = v. 


1.3. Restriction of an inner product to a subspace 
Prove Proposition 1.2. 


1.4* Positive linear combination of inner products 
Prove Proposition 1.3. 


1.5 Extending a vector to a vector field 
Let M be a manifold. Show that for any tangent vector v € TM, there is a C™ vector field X 
on M such that Xp = v. 


1.6 Equality of vector fields 
Suppose (M,(, )) is a Riemannian manifold. Show that two C® vector fields X,Y € X(M) 
are equal if and only if (X,Z) = (Y,Z) for all C® vector fields Z € ¥(M). 


1.7 Upper half-plane 
Let 


H? = {(x,y) €R? | y > 0}. 
At each point p = (x,y) € H?, define 
(, me: TpH? x TpH? > R 
by 
1 
(u,v) pp = p(s): 


where ( , ) is the usual Euclidean inner product. Show that ( , )pe is a Riemannian metric 
2 
on Hi’. 
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1.8. Product rule in R” 
If f,g: R— R" are differentiable vector-valued functions, show that (f,g): R > R is differ- 
entiable and 


(f.8) = (f'.8) + (f.8’). 
(Here f’ means df /dt.) 


1.9. Product rule in an inner product space 
An inner product space (V,{ , )) is automatically a normed vector space, with norm 
\|v|| = \/(v,v). The derivative of a function f: R — V is defined to be 


fe+h)—fY) 
h ’ 


f(t) = lim 


h-0 


provided that the limit exists, where the limit is taken with respect to the norm || ||. If f,g: 
R— V are differentiable functions, show that (f,g): R — R is differentiable and 


(f.g) = (figy+ (fig). 
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$2 Curves 


In common usage a curve in a manifold M can mean two things. Either it is a 
parametrized curve, i.e., a smooth map c: [a,b] — M, or it is the set of points in M 
that is the image of this map. By definition, a smooth map on a closed interval is a 
smooth map on some open set containing the interval. For us, a curve will always 
mean a parametrized curve. When the occasion calls for it, we will refer to the image 
of a parametrized curve as a geometric curve. 

A regular curve is a parametrized curve whose velocity is never zero. A regular 
curve can be reparametrized by arc length. In this section we define the signed 
curvature of a regular plane curve in terms of the second derivative of its arc length 
parametrization. 


2.1 Regular Curves 


Definition 2.1. A parametrized curve c: [a,b] — M is regular if its velocity c’(t) 
is never zero for all ¢ in the domain [a,b]. In other words, a regular curve in M is an 
immersion: [a,b] > M. 


Example 2.2. The curve c: [—1,1] > R?, 


is not regular at t = O (Figure 2.1). This example shows that the image of a smooth 
nonregular curve need not be smooth. 


Fig. 2.1. A nonregular curve. 


If t = t(u) is a diffeomorphism of one closed interval with another, then B (u) := 
c(t(u)) is a reparametrization of the curve c(t). The same geometric curve can have 
many different parametrizations. Among the various reparametrizations of a regular 
curve, the most important is the arc length parametrization. 
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As in calculus, we define the speed of a curve c: [a,b] > M in a Riemannian man- 
ifold M to be the magnitude ||c’(t)|| of its velocity c’(t), and the arc length of the 


curve to be 5 
i=) 1c! (u)|| du. 
a 


For each ¢ € [a,b], let s(t) be the arc length of the curve c restricted to [a,1]: 


= [lela 


The function s: [a,b] — [0,£] is the arc length function of the curve c. By the fun- 
damental theorem of calculus, the derivative of s with respect to t is s’(t) = ||c’(t)||, 
the speed of c. 


Proposition 2.3. The arc length function s: [a,b] > [0,€| of a regular curve 
c: [a,b] > M has a C® inverse. 


Proof. Because c(t) is regular, s(t) = ||c’(t)|| is never zero. Then s’(t) > 0 for 
all t. This implies that s(t) is a monotonically increasing function, and so has an 
inverse t(s). By the inverse function theorem, t is a C® function of s. 


Thus, given a regular curve c(t), we can write t as a C™ function of the arc length 
s to get the arc length parametrization y(s) = c(t(s)). 


Proposition 2.4. A curve is parametrized by arc length if and only if it has unit speed 
and its parameter starts at 0. 


Proof. As noted above, the speed of a curve c: [a,b] + M can be computed as the 
rate of change of the arc length s with respect to ¢ € [a,b]: 


ds 
i = 
lee S. 


Let y(s) be the arc length reparametrization of c. Since s(a) = 0, the parameter s 
starts at 0. By the chain rule, the velocity of y is 


7 (s) =e'(t(s))t"(s). 
Hence, the speed of y is 


ds 
dt 


dt| _ 
ds| 


ds dt 


IIs) Il = MeO) I) | = Tala 


Conversely, if a curve c(t) has unit speed, then its arc length is 


t t 
=e |cw)ldu= | tdu=ta. 
a a 


If a = 0, then s =t. So a unit-speed curve starting at t = 0 is parametrized by arc 
length. 
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Example 2.5. The curve c: [0,27] — R?, 
c(t) = (acost,asint), a>0, 


is regular. Its image is the circle of radius a centered at the origin. Its arc length 


function is 
t t a - 
if |cwldu= [ | fe au 
(0) 0 acosu 


t 
=| adu= at. 
0 


Hence, t = s/a and the arc length parametrization is 


AY 


y(s) = (acos * asin=) : 


a a 


2.3 Signed Curvature of a Plane Curve 


We all have an intuitive idea of what curvature means. For example, a small circle 
appears to curve more than a large circle. In this section, we will quantify the notion 
of curvature for a curve in the plane R?. 

Our plane curve y: [0,¢] —> R? will be parametrized by the arc length s. Then the 
velocity vector T(s) = y’(s) has unit length and is tangent to the curve at the point 
p=yY(s). A reasonable measure of curvature at p is the magnitude of the derivative 


dT 
'(s)==(s) =7'9), 

since the faster T changes, the more the curve bends. However, in order to distinguish 
the directions in which the curve can bend, we will define a curvature with a sign. 

There are two unit vectors in the plane perpendicular to T(s) at p. We can choose 
either one to be n(s), but usually n(s) is chosen so that the pair (T(s),n(s)) is ori- 
ented positively in the plane, i.e., counterclockwise. 

Denote by (, ) the Euclidean inner product on R?. Since T has unit length, 


(T,T) =1. 
Using the product rule (Problem 1.8) to differentiate this equation with respect to s 
gives 
(7",T) +(2,T') =0, 
or 
27.7) =0. 


Thus, 7’ is perpendicular to T and so it must be a multiple of n. The scalar K such 
that 
T’=kn 
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is called the signed curvature, or simply the curvature, of the plane curve at 
p = y(s). We can also write 


K= (T',n) — (7",n). 
The sign of the curvature depends on the choice of n; it indicates whether the curve 
is bending towards n or away from n (Figure 2.2). 


Kk>0 


Fig. 2.2. The sign of the curvature k. 


Example 2.6 (The circle). By Example 2.5, the circle of radius a centered at the origin 
has arc length parametrization 


y(s) = (acos =,asin 7) , O<s<2na. 
a a 


The unit tangent vector to the curve is 


Its derivative is 


Tisy=V"s)=| 4 8 


We choose the unit normal n so that the pair (T,n) is oriented counterclockwise. 
This means n is obtained from T by multiplying by the rotation matrix 


tela a 
rot (4) = 2 2) jO-1 
2/ |. a xz| {1 O}° 
sin— cos— 
2 2, 
Hence, 
cos — 
0-1 a 
o= (i T= _ Ss 
—sin— 
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So T' = (1/a)n, which shows that the signed curvature K(s) of a circle of radius a is 
1/a, independent of s. This accords with the intuition that the larger the radius of a 
circle, the smaller the curvature. 


2.4 Orientation and Curvature 


A curve whose endpoints are fixed has two possible arc length parametrizations, 
depending on how the curve is oriented. If the arc length of the curve is @, then the 
two parametrizations y(s), 7(s) are related by 


Hs) =(e—5). 
Differentiating with respect to the arc length s gives 
T(s) =7(s) =-Y(¢-8) =-T(l-s) and 7"(5)=7"(E-8), 
Rotating the tangent vector T(s) by 90° amounts to multiplying T(s) on the left 
by the rotation matrix rot(2/2). Thus, 


ii(s) = rot (5) T(s) = rot (7) T(€—s) =—n(€—s). 
It follows that the signed curvature of 7 at 7(s) = y(€—s) is 
K(s) = (7"(s),i(s)) = (y"(€—s), —m(€— 8)) = —K (£5). 


In summary, reversing the orientation of a plane curve reverses the sign of its signed 
curvature at any point. 


Se 


Sh 


Counterclockwise circle Clockwise circle 


Fig. 2.3. Reversing the orientation of a curve changes the sign of the curvature at a point. 


Example 2.7. The counterclockwise circle 
y(s) = (acoss/a,asins/a), O<s<2za, 
has signed curvature 1/a, while the clockwise circle 
Hs)=1(-s), 0<s<2na, 


has signed curvature —1/a. Geometrically, the unit tangent vector T of the counter- 
clockwise circle turns towards the normal vector n, while the unit tangent vector T 
of the clockwise circle turns away from the normal vector fi (Figure 2.3). 
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Problems 


2.1 Signed curvature 
Let T(s) be the unit tangent vector field on a plane curve y(s) parametrized by arc length. 


Write “iets 
70)= [iret 


where 0(s) is the angle of T(s) with respect to the positive horizontal axis. Show that the 
signed curvature « is the derivative d@/ds. 


2.2. Curvature of a unit-speed plane curve 
Suppose y(s) = (x(s),y(s)) is a unit-speed curve in the plane. Write x’ = x’(s) = dx/ds and 


y' =y'(s) = dy/ds. 


(a) Show that the signed curvature kK of y(s) is —v’/y’ when y’ 40 and y”/x’ when x’ 4 0. 
(b) For a unit-speed curve, 6 = tan~!(y'/x’). By calculating d0/ds, show that the signed 
curvature is K = xy" — xy’. (If x’ = 0, then because the curve is unit speed, y’ 4 0 and 


6 =tan~!(+o0) = 1/2.) 


2.3. Curvature of a regular plane curve 
In physics literature, it is customary to denote the derivative with respect to time by a dot, 
e.g., X = dx/dt, and the derivative with respect to distance by a prime, e.g., x’ = dx/ds. We 
will sometimes follow this convention. Let c(t) = (x(t), y(t)) be a regular plane curve. Using 
Problem 2.1 and the chain rule d@/ds = (d6@/dt)/(ds/dt), show that the signed curvature of 
the curve c(t) is 

<- 8 
( x2 ae y2)3 (2? 
where x = dx/dt and * = d?x/dt?. 


2.4. Curvature of a graph in the plane 
The graph of a C® function y = f(x) is the set 


{(x, f(a) |x RB} 
in the plane. Showed that the signed curvature of this graph at (i, f (x)) is 
= f(x) 
K 372° 
(1+ (7) 


2.5. Curvature of an ellipse 
The ellipse with equation x? /a” +-y? /b* = | in the (x, y)-plane (Figure 2.4) can be parametrized 
by 

x=acost, y=Dbsint, O<t< 27. 


Find the curvature of the ellipse at an arbitrary point (x,y). 


2.6. Arc length of a cuspidal cubic 
The cuspidal cubic r= y (Figure 2.4) can be parametrized by (3,17). Find its arc length 
from t = 0 tot=a. 
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Fig. 2.4. An ellipse and a cuspidal cubic 


2.7* Curvature of a space curve 

Let I be a closed interval in R. If y: 1 + R? is a regular space curve parametrized by arc length, 
its curvature « at y(s) is defined to be ||/"(s)||.! Consider the helix c(t) = (acost,asint, bt) 
in space. 


(a) Reparametrize c by arc length: y(s) = c(t(s)). 
(b) Compute the curvature of the helix at y(s). 


2.8. Curvature of a line in space 
Show that the curvature of a line in space is zero at every point. 


2.9. The Frenet-Serret formulas 
Let y: J > R?3 be a regular space curve parametrized by arc length. Then T = ¥ (s) is tangent 
to yat y(s) and has unit length. Assume that 7’(s) 4 0. 


(a) Prove that y’(s) is normal to T. 

(b) Let N be the unit vector y’(s)/||7/'(s)||. Then 7’ = KN, where « is the curvature of the 
space curve. 

(c) The unit vector B = T x N is called the binormal of y at y(s). The three vectors T,N,B 
form an orthonormal basis for R? at y(s), called the Frenet—-Serret frame (Figure 2.5). 
Prove that 

N =—xT +B 


for some real number T, which is called the torsion of the unit-speed curve y. 
(d) Prove that B’ = —tN. 


The set of equations 


T' = KN 
N' =—xT +TB 
B= —tN 


'The curvature of a space curve is always nonnegative, while the signed curvature of a 
plane curve could be negative. To distinguish the two for a space curve that happens to be 
a plane curve, one can use ky for the signed curvature. We will use the same notation Kk for 
both, as it is clear from the context what is meant. 
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Fig. 2.5. A Frenet—Serret frame. 


is called the Frenet—Serret formulas. In matrix notation, 


Tl Kk T 
N| =|-k T| IN]. 
B —T B 


Jean Frédéric Frenet (1816-1900) and Joseph Alfred Serret (1819-1885) 
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83 Surfaces in Space 


There are several ways to generalize the curvature of a plane curve to a surface. One 
way, following Euler (1760), is to consider the curvature of all the normal sections of 
the surface at a point. A second way is to study the derivative of a unit normal vector 
field on the surface. In this section we use Euler’s method to define several measure- 
ments of curvature at a point on a surface. We then state the two theorems, Gauss’s 
Theorema Egregium and the Gauss—Bonnet theorem, that will serve as guideposts in 
our study of differential geometry. We will take up the relationship between curva- 
ture and the derivative of a normal vector field in Section 5. 


3.1 Principal, Mean, and Gaussian Curvatures 


Recall that a regular submanifold of a manifold M is a subset of the manifold M 
locally defined by the vanishing of coordinate functions (see [21, Section 9] for a 
discussion of regular submanifolds). By a surface in R*, we mean a 2-dimensional 
regular submanifold of R?. Let p be a point on a surface M in R?. A normal vector to 
M at p is a vector N, € T,R? that is orthogonal to the tangent plane 7,M. A normal 
vector field on M is a function N that assigns to each p € M anormal vector N, at p. 
If N is anormal vector field on M, then at each point p € M, we can write 


a 7] 
N, — f aT . 
Pp > a'(p) ay : 
The normal vector field N on M is said to be C® if the coefficient functions a! ,a*,a? 


are C™ functions on M. 


i 


Fig. 3.1. Normal section at p. 


Let N be a C™ unit normal vector field on a neighborhood of p in M. Denote by 
Np the unit normal vector at p. Under the canonical identification of T,R? with R3, 
every plane P through N, slices the surface M along a plane curve PM through p. 
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By the transversality theorem from differential topology, the intersection PM, be- 
ing transversal, is smooth (see Problem 3.2). We call such a plane curve a normal 
section of the surface through p (Figure 3.1). Assuming that the normal sections 
have C™ parametrizations, which we will show later, we can compute the curvature 
of a normal section with respect to Np. The collection of the curvatures at p of all 
the normal sections gives a fairly good picture of how the surface curves at p. 

More precisely, each unit tangent vector X, to 
the surface M at p determines together with N, a 
plane that slices M along a normal section. More- 
over, the unit tangent vector X, determines an ori- 
entation of the normal section. Let y(s) be the arc 
length parametrization of this normal section with ini- 
tial point y(0) = p and initial vector y/(0) = Xp. Note 
that y(s) is completely determined by the unit tangent 
vector X,. Define the normal curvature of the normal 
section y(s) at p with respect to N, by 


K(Xp) = (Y'(0),Np). (3.1) 


Of course, this N, is not always the same as the n(0) 
in Section 2, which was obtained by rotating the unit 
tangent vector 90° counterclockwise in R*; an arbi- 
trary plane in R* does not have a preferred orienta- (1707-1783) 
tion. 

Since the set of all unit vectors in 7,,M is a circle, 
we have a function 


Leonhard Euler 


(Portrait by Jakob Emanuel 
Handmann, 1753) 
Kk: S' OR. 
Clearly, k(—X,) = K(X,) for X, € S', because replacing a unit tangent vector by 
its negative simply reverses the orientation of the normal section, which reverses the 
sign of the first derivative y/(s) but does not change the sign of the second derivative 
7'(s). 

The maximum and minimum values Kk, k2 of the function K« are the principal 
curvatures of the surface at p. Their average (Kk, + k2)/2 is the mean curvature H, 
and their product kj ky the Gaussian curvature K. A unit direction X, € T,M along 
which a principal curvature occurs is called a principal direction. Note that if Xp is 
a principal direction, then so is —Xp, since K(—X,) = K(X;). If «, and ky are equal, 
then every unit vector in 7,,M is a principal direction. 


Remark 3.1. Using —N, instead of Np reverses the signs of all the normal curvatures 
at p, as one sees from (3.1). This will change the sign of the mean curvature, but it 
leaves invariant the Gaussian curvature. Thus, the Gaussian curvature K is indepen- 
dent of the choice of the unit normal vector field NV. 


Example 3.2 (Sphere of radius a). Every normal section of a sphere of radius a is 
a circle of radius a. With respect to the unit inward-pointing unit normal vector 
field, the principal curvatures are both 1/a (see Example 2.6), the mean curvature is 
H = 1/a and the Gaussian curvature is K = 1/a?. 
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Example 3.3. For a plane M in R? the principal curvatures, mean curvature, and 
Gaussian curvature are all zero. 


Example 3.4. For a cylinder of radius a with a unit inward normal, it appears that the 
principal curvatures are 0 and 1/a, corresponding to normal sections that are a line 
and a circle, respectively (Figure 3.2). We will establish this rigorously in Section 5. 
Hence, the mean curvature is 1/2a and the Gaussian curvature is 0. If we use the 
unit outward normal on the cylinder, then the principal curvatures are 0 and —1/a, 
and the mean curvature is —1/2a, but the Gaussian curvature is still 0. 


yo 1 = 0 


kj == 


Fig. 3.2. Principal curvatures of a cylinder. 


3.2 Gauss’s Theorema Egregium 


Since the plane is locally isometric to a cylinder, Ex- 
amples 3.3 and 3.4 show that the principal curvatures 
K,, K2 and the mean curvature H are not isometric in- 
variants. It is an astonishing fact that although neither 
K} Nor k2 is invariant under isometries, their product, 
the Gaussian curvature K, is. This is the content of 
the Theorema Egregium of Gauss. 

Another way to appreciate the significance of this 
theorem is to think in terms of isometric embeddings. 
We may think of portions of the plane and the cylin- 
der as different isometric embeddings of the same 
planar region. The examples show that the princi- 
pal curvatures kK}, 2 and the mean curvature depend 
on the embedding. The product kj k) would a priori 
seem to depend on the embedding, but in fact does 
not. In the next few sections we will develop the ma- (1777-1855) 
chinery to prove the Theorema Egregium. 


Carl Friedrich Gauss 


(Artist: Gottlieb Biermann, 
1887) 
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3.3 The Gauss—Bonnet Theorem 


For an oriented surface M in R? the Gaussian curvature K is a function on the surface. 
If the surface is compact, we can integrate K to obtain a single number f,, K dS. Here 
the integral is the usual surface integral from vector calculus. 


Example 3.5. For the sphere M of radius a (Figure 3.3), the integral of the Gaussian 


curvature is 


| Kas= 
M 


LB] - = 


a 


= 


= 4nd’ =4n. 


Q 
tw 


Fig. 3.3. Sphere of radius a. 


1 1 
—~ dS = —~ ldS 
a ae 


(surface area of M) 


Notice that although the Gaussian curvature of a sphere depends on the radius, 
in the integration the radius cancels out and the final answer is independent of the 


radius. 


Example 3.5 is a special case of the Gauss—Bonnet 
theorem, which for a compact oriented surface M in R? 
asserts that 


[ Kas =2nx(M), (3.2) 
M 


where 7(M) denotes the Euler characteristic. Equa- 
tion (3.2) is a rather unexpected formula, for on the 
left-hand side the Gaussian curvature K depends on a 
notion of distance, but the right-hand side is a topolog- 
ical invariant, independent of any Riemannian metric. 
Somehow, in the integration process, all the metric in- 
formation gets canceled out, leaving us with a topolog- 
ical invariant. For the 2-sphere, the Euler characteristic 
is 2 and the theorem checks with the computation in 
Example 3.5. 


Pierre Ossian Bonnet 


(1819-1892) 
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In due course we will study the theory of characteristic classes for vector bundles, 
a vast generalization of the Gauss—Bonnet theorem. 


Problems 


3.1. Principal curvatures in terms of K and H 
Compute the principal curvatures x; , ky at a point of an oriented surface in R? in terms of its 
Gaussian curvature K and mean curvature H. 


3.2. Normal section at a point 

Suppose M is a smooth surface in R3, p a point in M, and N a smooth unit normal vector field 
on a neighborhood of p in M. Let P be the plane spanned by a unit tangent vector X, € T,M 
and the unit normal vector N,. Denote by C := PM the normal section of the surface M at 
p cut out by the plane P. 


(a) The plane P is the zero set of some linear function f (x,y,z). Let f: M — R be the restric- 
tion of f to M. Then the normal section C is precisely the level set f~!(0). Show that p is 
a regular point of f, i.e., that the differential f, ): Tp>M — ToR is surjective. (Hint: Which 
map is fx,p: R3 = TR? > Tr(p)R = R? What is its kernel?) 

(b) Show that a normal section of M at p is a regular submanifold of dimension one in a 
neighborhood of p. (Hint: Apply [21, Proposition 11.4] and the regular level set theorem 
(21, Theorem 9.9].) 


3.3. Regular submanifold of dimension one 

Show that if a curve C in a smooth surface is a regular submanifold of dimension one in a 
neighborhood of a point p € C, then C has a C™ parametrization near p. (Hint: Relative to an 
adapted chart (U xt x? ) centered at p, the curve C is the x!-axis. A C® parametrization of the 


x!-axis in the (x! ,x?)-plane is (x!,x*) = (t,0).) 
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$4 Directional Derivatives in Euclidean Space 


The directional derivative is one way of differentiating vector fields on R” with 
respect to a tangent vector. In this section we extend the calculus definition of the 
directional derivative of a function to the directional derivative of a vector field along 
a submanifold of R”. 


4.1 Directional Derivatives in Euclidean Space 


Suppose X, = Yad /dx'|, is a tangent vector at a point p = (p!,...,p”) in R” and 
f(x!,...,x") is a C® function in a neighborhood of p in R”. To compute the direc- 
tional derivative of f at p in the direction Xp, we first write down a set of parametric 
equations for the line through p in the direction X,: 


x = pitta’, i=1,...,n. 
Let a = (a!,...,a"). Then the directional derivative Dx, f is 
_ fetta) —f(p) _ 
Dy f _ lim t dt ee) (4.1) 
dx! ; 

=r Gna a (by the chain rule) 

P Pp 
, 0 


In calculus, X, is required to be a unit vector, but we will allow X, to be an arbitrary 
vector at p. 

As a shorthand, we write 0; for 0 /0x'. The directional derivative at p of a C* 
vector field Y = bd; = )b'd /dx! on R” in the direction X, is defined to be 


7) 


Dx, Y = >(X)b") 5 (4.2) 


P 


This formula shows clearly that Dy ¥ is R-linear in Xp. 

Although (4.1) computes the directional derivative using the values of f along a 
line through p, we can in fact use any curve c(t) with initial point p and initial vector 
X, (Figure 4.1), for 


dt 


Dx, f =Xpf c'(0) f = cs ( 
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Xp 
c(t) 
P 


Fig. 4.1. Tangent vector at a point 


Remark 4.1. Thus, for Dx, f to be defined, it is not necessary that f be defined in an 
open neighborhood of p. As long as f is defined along some curve starting at p with 
initial velocity X;, the directional derivative Dx, f will make sense. A similar remark 
applies to the directional derivative Dx, Y of a vector field Y. 


When X is a C™ vector field on R”, not just a vector at p, we define the vector 
field DyY on R” by 
(DxY)p = Dx,Y forall pe R”. 
Equation (4.2) shows that if X and Y are C™ vector fields on R”, then so is DyY. Let 
3€(IR") be the vector space of all C® vector fields on R”. The directional derivative 
in R” gives a map 


D: X(R") x X(R") > X(R"), 
which we write as DxY instead of D(X,Y). Let F¥ = C”(R”) be the ring of C” 
functions on R”. Then X(R”) is both a vector space over R and a module over F. 


Proposition 4.2. For X,Y € X(R"), the directional derivative DxY satisfies the fol- 
lowing properties: 


(i) DxY is F-linear in X and R-linear in Y; 
(ii) (Leibniz rule) if f is a C® function on R", then 


Dx(fY) = (KAY + fDxY. 
Proof. (i) Let f be a C® function on R” and p an arbitrary point of R”. Then 
(Dpx¥)p = Dyip)x,¥ 
= f(p)Dx,¥ (because Dy, Y is R-linear in X, by (4.2)) 
= (fDxY)p. 
For Z € X(R”), we have 
(Dx+zY)p _ Dx, +Z,¥ = Dx,Y +Dz,Y — (DxY +DzY)>p. 


This proves that DxY is F-linear in X. The R-linearity in Y is clear from (4.2). 
(ii) Suppose Y = ¥\b'd;, where b' € C*(R"). Then 


(Dx(fY))p = ¥X)(fb') Or| , 
= D(X, f)b'(P) Alp +L (P)Xpb! Ail, 
= (Xpf)¥p + f(p)Dx,¥ 
= ((Xf)¥ + fDxY)p. 
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4.2 Other Properties of the Directional Derivative 

Since the directional derivative D in R” is an R-bilinear map 
D: X(R") x X(R") > X(R”), 


one can ask if it is symmetric, that is, for all X,Y € X(IR"), is DyY = DyX? A simple 
calculation using the standard frame shows that the answer is no; in fact, if [X,Y] is 
the Lie bracket defined in (A.2), then 


DxY — DyX = [X,Y]. 
The quantity 
T (X,Y) = DxyY — DyX — [X,Y] 


turns out to be fundamental in differential geometry and is called the torsion of the 
directional derivative D. 

For each smooth vector field X € X(R"), the directional derivative Dy : X(R") > 
3(IR”) is an R-linear endomorphism. This gives rise to a map 


X(R") > End (X(R")), 


4.3 
X +> Dy. ( ) 


The vector space X(IR") of C® vector fields on R” is a Lie algebra under the Lie 
bracket of vector fields. For any vector space V, the endomorphism ring Endp(V) of 
endomorphisms of V is also a Lie algebra, with Lie bracket 


[A,B] =A°B-—B°eA, A,BEEnd(V). 


So the map in (4.3) is an R-linear map of Lie algebras. It is natural to ask if it is a 
Lie algebra homomorphism, i.e., is 


[Dx , Dy] = Diy. y)? 


The answer is yes for the directional derivative in R”. A measure of the deviation 
of the linear map X ++ Dy from being a Lie algebra homomorphism is given by the 
function 


R(X,Y) = [Dx , Dy] —Dx,y) = Dx Dy — Dy Dx Dixy] Endp(X(M)), 


called the curvature of D. 
Finally, one might ask if the product rule holds for the Euclidean inner product: 


Dz(X,Y) = (DzX,Y) + (X,DzY). 


The answer is again yes. The following proposition summarizes the properties 
of the directional derivative in R”. 


Proposition 4.3. Let D be the directional derivative in R"” and X,Y,Z C™ vector 
fields on R". Then 
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(i) (zero torsion) DyY — DyX — [X,Y] =0, 
(ii) (zero curvature) Dy DyZ — Dy Dy Z — Dix yjZ =0, 
(iii) (compatibility with the metric) X(Y,Z) = (DxY,Z) + (Y, DxZ). 


Proof. (i) Problem 4.2. 
(ii) Let Z = ¥c'd; € X(R"). Then 
Dx DyZ = Dx (¥(¥c')0;) = YY (X¥ c') oj. 
By symmetry, 
DyDxZ = > (YXc')9j. 


So 
DxDyZ—DyDxZ = (XY —YX)c'd; = Dy y/Z. 


(iii) Let Y = Sb'd; and Z = Y.c/d; € X(R"). Then 


XYZ) =XO/b') = kb) + ¥ (Xe), 
= (DyY,Z) + (Y,DxZ). 


If X and Y are smooth vector fields on a manifold M, then the Lie derivative 
£xY is another way of differentiating Y with respect to X (for a discussion of the 
Lie derivative, see [21, Section 20]). While the directional derivative DyY in R” is 
F-linear in X, the Lie derivative LyxY is not, so the two concepts are not the same. 
Indeed, since £yY = [X,Y], by Proposition 4.3(i), for vector fields on R", 


LyY = DxY — DyX. 


4.3 Vector Fields Along a Curve 


Suppose c: [a,b] + M is a parametrized curve in a manifold M. 


Definition 4.4. A vector field V along c: |a,b| — M is the assignment of a tangent 
vector V(t) € T.(7)M to each t € [a,b]. Such a vector field is said to be C® if for every 
C* function f on M, the function V(t) f is C” as a function of t. 


Example 4.5. The velocity vector field c'(t) of a parametrized curve c is defined by 


d 
i ee, a 
ef) (4 


) € Te). 


t 


It is a vector field along c. 


Example 4.6. If V is a vector field on M and c: [a,b] > M is a parametrized curve in 
M, then V induces a vector field V along c by 


V(t) = Vow) ‘ 
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Now suppose c: [a,b] + R” is a curve in R” and V is a C® vector field along c. 
Then V(t) can be written as a linear combination of the standard basis vectors: 


Vit) = Vv) lew): 
So it makes sense to differentiate V with respect to f: 


dV dy! 
a ae Oiler) 


which is also a C™ vector field along c. 

For a smooth vector field V along a curve c in an arbitrary manifold M, without 
further hypotheses on M, the derivative dV /dt is in general not defined. This is 
because an arbitrary manifold does not have a canonical frame of vector fields such 


as 0/dx',...,0/dx" on R". 


Proposition 4.7. Let c: [a,b] > R" be a curve in R" and let V(t),W(t) be smooth 
vector fields along c. Then 


“v(t).W() = (aw) if (¥, 7). 


Proof. Write V(t) and W(t) in terms of 0j|-() and differentiate. 


4.4 Vector Fields Along a Submanifold 


Let M be a regular submanifold of a manifold M. At a point p in M, there are two 
kinds of tangent vectors to M, depending on whether they are tangent to M or not. 
For example, if M is a surface in R*, the vectors in a tangent vector field on M are 
all tangent to M, but the vectors in a normal vector field on M are not tangent to M 
but to R3. 


Definition 4.8. Let M be a submanifold of a manifold M. A vector field X on M 
assigns to each p € M a tangent vector X, € T,M. A vector field X along M in M 
assigns to each p € M a tangent vector Xp € T,M . A vector field X along M is called 
C® if for every C® function f on M, the function Xf is C* on M. 


The distinction between these two concepts is indicated by the prepositions “on” 
and “along.” While it may be dangerous for little prepositions to assume such grave 
duties, this appears to be common usage in the literature. In this terminology, a 
normal vector field to a surface M in R? is a vector field along M in R?, but not a 
vector field on M. Of course, a vector field on M is a vector field along M. 

As in Section A.3, the set of all C” vector fields on a manifold M is denoted by 
X(M). The set of all C® vector fields along a submanifold M in a manifold M will 
be denoted by '(7M|y). They are both modules over the ring F = C*(M) of C” 
functions on M. 
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4.5 Directional Derivatives on a Submanifold of R” 


Suppose M is a regular submanifold of R". At any point p € M, if Xp € T,M isa 
tangent vector at p and Y = ¥.b'd; is a vector field along M in R", then the directional 
derivative Dey is defined. In fact, 


Dx,¥ = ¥(Xpb') dil,. 


As p varies over M, this allows us to associate to a C™ vector field X on M anda 
C™ vector field Y along M a C™ vector field DyY along M by 


DxY = > (Xb')d; €T(TR"|y). 


For any p € M, we have (DxY)p = Dx,Y by definition. It follows that there is an 
R-bilinear map 


D: ¥(M) x T(TR"|y) 3 T(TR"|u) 
D(X,Y) = Dx’. 


We call D the directional derivative on M. Because of the asymmetry between X 
and Y—one is a vector field on M, the other a vector field along M, the torsion 
T (X,Y) := DyY — DyX — [X,Y] no longer makes sense. Otherwise, D satisfies the 
same properties as the directional derivative D on R”. 


Proposition 4.9. Suppose M is a regular submanifold of RR” and D is the directional 
derivative on M. For X © X(M) andY €T(TR"|m), 


(i) D(X,Y) = DyY is F-linear in X and R-linear in Y; 
(ii) (Leibniz rule) if f € C*(M), then 


D(X, fY) = Dx(f¥) = (Xf)¥ + (DxY. 


Proposition 4.10. Suppose M is a regular submanifold of R" and D is the directional 
derivative on M. 


(i) (zero curvature) If X,Y € X(M) and Z © T(TR"|m), then 
DxDyZ —DyDxZ—Dy |Z =0. 

(ii) (compatibility with the metric) If X © X(M) and Y,Z € T(TR"|y), then 
X(Y,Z) = (DxY,Z) + (Y,DyZ). 


Proof. By Remark 4.1, these are proven in exactly the same way as Proposition 4.3. 


Suppose V is a C® vector field along a regular submanifold M in R” and 
c: [a,b] > M is aC” curve in M. Then V induces a vector field V along c: 


V(t) = Vow) € T.q)R". 
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Proposition 4.11. Differentiation with respect to t of a vector field along a curve is 
the directional derivative in the tangent direction: 


Proof. Let 


Since V(t) = V.q), 


By the definition of c’(t), 


d d d 
Ny) si O | ete en) 
di =e( 2) P= (Poe) = Fy 
Therefore, 
DeyV = YC ()) Ailey (by (4.2) 
ya le) = ad 
Le de ct) dt 
Problems 


4.1. Lie bracket of vector fields 
Let f,g be C® functions and X,Y be C™ vector fields on a manifold M. Show that 


[fX,8Y] = fg[X,Y] + f(Xg)¥ —g(Vf)X. 


(Hint: Two smooth vector fields V and W on a manifold M are equal if and only if for every 
hec*(M), Vh=Wh.) 


4.2. Directional derivative in R” 
Prove Proposition 4.3(i). 


4.3. Directional derivative on a submanifold 
Let M be a regular submanifold of IR” and 


D: ¥(M) x T(TR"|y) + T(TR" |) 


the directional derivative on M. Since X(M) C I(TR"|y), we can restrict D to X(M) x X(M) 
to obtain 
D: X(M) x X(M) > T(TR"|y). 


(a) Let T be the unit tangent vector field to the circle $ ! Prove that DrT is not tangent to $ f 
This example shows that D| (14) x() does not necessarily map into X(M). 
(b) If X,Y € X(M), prove that 


DxY —DyX = [X,Y]. 
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To define the curvature of a plane curve in Section 2, we differentiated its unit tangent 
vector field with respect to arc length. Extrapolating from this, we can try to describe 
the curvature of a surface in R° by differentiating a unit normal vector field along 
various directions. 


5.1 Normal Vector Fields 


Recall that a point p of R” is a regular point of a function f: R” — R if the differen- 
tial fp: Tp»R”" — Ty(p)R is surjective, equivalently, if at least one partial derivative 


df /Ax'(p) is nonzero. A point q € R is a regular value if its inverse image f—'(q) 
consists entirely of regular points; otherwise, it is a singular value. 

A hypersurface in R” is the zero set Z(f) of aC” function f on R”. Consider a 
hypersurface M in R? defined as the zero set of the C® function f (x,y,z). We assume 
that the partial derivatives f,, f;, f, do not vanish simultaneously on M. Then 0 is a 
regular value of f, and M = f—'(0) is a regular level set. By the regular level set 
theorem, M is a regular submanifold of R°?. 

As a submanifold of R*, M inherits a Riemannian metric from the Euclidean 
metric on R?. Let N = grad f = (f,, fy, fz) be the gradient vector field of f on M. 


Proposition 5.1. /f0 is a regular value of the C® function f on R?, then N = grad f is 
a nowhere-vanishing normal vector field along the smooth hypersurface M = Z(f). 


Proof. Let p € M and X, € T,M. It suffices to show that (N,,X,) = 0. Choose a 
curve c(t) = (x(t), y(t),z(t)) in M with c(0) = p and c’(0) = X, (such a curve always 
exists, for example by [21, Proposition 8.16]). Since c(t) lies in M, f(c(t)) = 0 for 
all t. By the chain rule, 


d 


0= — 
dt | _0 


F(e(t)) = fe(p)x'(0) + fy(P)y'(0) + e(p)z'(0). 


Hence, 


(Np.Xp) = (grad f(p),!(0)) =0. 


By dividing grad f by its magnitude, we obtain a smooth unit normal vector field 
along the hypersurface M. Since a smooth surface in R°, orientable or not, is locally 
the zero set of a coordinate function, it follows from this proposition that a smooth 
unit normal vector field exists locally along any smooth surface in R?. 
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M 


Fig. 5.1. A normal vector field near p. 


5.2 The Shape Operator 


Let p be a point on a surface M in R? and let N be a C® unit normal vector field on 
M (Figure 5.1). For any tangent vector X, € T,M, define 


L,(Xp) = —Dy,N. 


This directional derivative makes sense, since N is defined along some curve with 
initial vector X,. We put a negative sign in the definition of L, so that other formulas 
such as Lemma 5.2 and Proposition 5.5 will be sign-free. Applying the vector X, to 
(N,N) = 1 gives 


0=X,(N,N) 
= (Dx,N Np) + (Np, Dx,N ) (compatibility with the metric, Proposition 4.3) 
= 2(Dy,N,Np). 


Thus, Dy,N is perpendicular to N, at p and is therefore in the tangent plane 7,M. So 
Lp is a linear map 7,M — T,M. It is called the shape operator or the Weingarten 
map of the surface M at p. Note that the shape operator depends on the unit normal 
vector field N and the point p. With the unit normal vector field N on M fixed, as 
the point p varies in M, there is a different shape operator L, at each p. To avoid 
cumbersome notation, we will usually omit the subscript p. 

The shape operator, being the directional derivative of a unit normal vector field 
on a surface, should encode in it information about how the surface bends at p. 


Lemma 5.2. Let M be a surface in R* having a C® unit normal vector field N. For 
X,Y © X(M), 
(L(X),Y) = (Dx¥,N). 


Proof. Since Y is tangent to M, the inner product (Y,N) is identically zero on U. 
Differentiating the equation (Y,N) = 0 with respect to X yields 


0 =X(Y,N) = (DxY,N) +(Y,DxN) 
= (DxY,N) — (Y,L(X)). 


Hence, 


(L(X),Y) = (DxY,N). 
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In general, if e;,...,e@, is a basis for a vector space V and 
v= > v'e€V, with v' ER, (5.1) 
i 
then v’ is called the component of v in the e;-direction. In case e1,...,é, is an ortho- 


normal basis in an inner product space (V,(, )), by taking the inner product of both 
sides of (5.1) with e;, one obtains 


(v,e;) = Vivile,e;) = hb, =v. 


Thus, the component v/ of v in the e j-direction is simply the inner product of v 
with e;. In view of this, the right-hand side of Lemma 5.2 is the normal component 
of the directional derivative Dy, Y. 

Note that any tangent vector X, € T,M can be extended to a C™ vector field in 
a neighborhood of p: if (U,x!,...,x”) is a chart containing p and X, = Ya'd/dx'|, 
for some a! € R, then X = Ya‘ d/dx' is such a vector field on U. 


Proposition 5.3. The shape operator is self-adjoint: for any Xp,Yp € T,M, 
(L(Xp),¥p) = (Xp, L(¥p))- 


Proof. Suppose the C™ unit normal vector field N is defined on a neighborhood U 
of p. Let X,Y be vector fields on U that extend the vectors Xp, Y, at p. By Lemma 5.2, 


(L(X),¥) = (Dx¥,N). (5.2) 
Similarly, reversing the roles of X and Y, we have 
(L(Y),X) = (DyX,N). (5.3) 


By Problem 4.3(b), 
DxY — DyX = [X,Y]. (5.4) 


Combining the three preceding equations (5.2), (5.3), and (5.4), we get 
(L(X),Y) — (L(¥),X) 
= (DxY,N) — (DyX,N) 
( 
([X,¥],N) (by (5.4)) 
=0 (since [X,Y] is a tangent vector field on M). 


Hence, 
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If T: V — W is a linear map between two vector spaces V and W, By = 
{v1,...,V¥n} a basis for V, and By = {w1,...,Wm} a basis for W, then 


m 


TWv)= Yai, j=l,...5n, 
i=1 


for a unique matrix [a‘], called the matrix of the linear map T with respect to the 
bases By and By. 

It follows from Proposition 5.3 that the matrix of the shape operator with respect 
to an orthonormal basis for 7M is symmetric, for if e),e2 is an orthonormal basis 
for T,M and 


L(e,) = ae, +be2, 


L(e2) = he, +ce2, 


then 
b = (L(e1),€2) = (e1,L(e2)) =h. 


Since the eigenvalues of a symmetric matrix are real (Problem 5.1), the shape 
operator has real eigenvalues. We will see shortly the meaning of these eigenvalues. 


5.3 Curvature and the Shape Operator 


Consider as before a surface M in R? having a C® unit normal vector field N. 
Lemma 5.2 on the shape operator has a counterpart for vector fields along a curve. 


Proposition 5.4. Let c: [a,b] — M be a curve in M and let V be a vector field in M 
along c. Then 


(L(c'(t)),V) = (2 Nan): 


Remark. When we write V or dV /dt, we mean V(t) and dV (t) /dt, respectively. 


Proof. Since V(t) is tangent to M, the inner product (V(t),N.(:)) is identically zero. 
By Proposition 4.7, differentiating with respect to ¢ yields 


d 
0= (V(t), New) 


Thus, 


(VO). L(C(0)) = WO). —Dew) = (Ney) 
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Proposition 5.5. Suppose y(s) is a normal section, parametrized by arc length, det- 
ermined by a unit tangent vector X, € T,M and the unit normal vector Ny. Then the 
normal curvature of y(s) with respect to Ny at p is given by the second fundamental 
form: 

(Xp) = (L(Xp), Xp) = (Xp, Xp). 
Proof. By definition, y(0) = p and y (0) = Xp. Let T(s) := 7'(s) be the unit tangent 
vector field along y(s). Then the curvature of the normal section y(s) is 


K(7(s)) = (7"(s),Nys)) 
= (dT /ds, 9) 
= (L(/(s)),7) (by Proposition 5.4) 
=(L(T),T). 


Evaluating at s = 0 gives 


(Xp) = (L(Xp),Xp) = I(Xp, Xp). 


Proposition 5.6. The principal directions of the surface M in R? at p are the unit 
eigenvectors of the shape operator L; the principal curvatures are the eigenvalues 
of L. 


Proof. The principal curvatures at p are the maximum and minimum values of the 
function 
(Xp) = I(Xp, Xp) = (L(Xp) Xp) 


for Xp € TM satisfying (Xp,Xp) = 1. This type of optimization problem with a 
constraint lends itself to the method of the Lagrange multiplier from vector calculus. 
Choose an orthonormal basis e; ,é2 for 7,M so that 


Xp = xe, +ye2 = i : 


By Proposition 5.3 the matrix of L relative to the basis e),e2 is a symmetric matrix 
ab 
= ~ | 
be 


L(e,) = ae, +be2, 


meaning 


L(e2) = be; + cep. 


In matrix notation, 


L(Xp) = L(xe; + ye) = (ax+by)e; + (bx +cy)ez 


_ jaxt+by| — jab} |x} _ 
— bese _ k i f =A 
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and 
K(Xp) = (L(Xp),Xp) = (AX,,Xp) = X, AX, 
= ax” + 2bxy+ cy’. 


The problem of finding the principal curvatures becomes a standard calculus 
problem: find the maximum and minimum of the function 


K(X») = ax + 2bxy + cy? = Xp AX, 
subject to the constraint 


g(Xp) = (Xp,Xp) =f +y =. 


Now 
_ |2ax+2by| _ 
grad kK = E hea ed = 2AX) 
and 
grad g = 2X). 


By the method of the Lagrange multiplier, at the maximum or minimum of k, there 
is a scalar A such that 


grad k =Agradg, or AX, =AXp. 


Thus, the maximum and minimum of k occur at unit eigenvectors of A. These are 
the principal directions at p. 

Let X, be a principal direction at p. Then the corresponding principal curvature 
is the normal curvature 


(Xp) = (L(Xp),Xp) = (AXp,Xp) = (AXp,Xp) = A (Xp, Xp) =A, 


the eigenvalue associated to the eigenvector Xp. 


Corollary 5.7. (i) The Gaussian curvature of a surface M in R? is the determinant 
of the shape operator. 

(ii) Ife), €2 is an orthonormal basis for the tangent space T,M of the surface M, then 
the Gaussian curvature at p is 


K = (L(e1),e1)(L(e2),€2) — (L(e1),€2) (L(@2);€1)- 


Proof. (i) The determinant of a linear map is the product of its eigenvalues (Prob- 
lem 5.2). For the shape operator L the eigenvalues are the principal curvatures kj, K2. 
So detL = ki k) = K, the Gaussian curvature. 
(ii) If 
L(e;) =ae;+be, and Le.) = be, +ce2, 
then 
a= (L(e1),€1); b= (L(e1),€2) = (L(e2),€1); C= (L(e2),€2). 
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The matrix of L relative to the orthonormal basis e 1, é2 is 
Fe 
bec|- 
Hence, 


detL = ac—b* 
= (L(e1),e1) (L(e2),€2) — (L(e1), €2) (L(e2),€1)- 


5.4 The First and Second Fundamental Forms 


A point p of a smooth surface M in R?, the Euclidean Riemannian metric is a sym- 
metric bilinear form on the tangent space 7,,M. It is called the first fundamental 
form of M at p. If L: T,M — T,M is the shape operator, the symmetric bilinear form 
on T,M 
(Xp, Y,) = (L(Xp), Y,) 
is called the second fundamental form of the surface M at p. The first fundamental 
form is the metric and the second fundamental form is essentially the shape operator. 
These two fundamental forms encode in them much of the geometry of the surface M. 
Let e1,é2 be a basis for the tangent space T,M. We set 


E:=(e1,e1), F:=(e1,e2), G:= (e2,e2). 
If Xp =xle,+27e and Y, = y'e, + yeo, then 
(Xp,Yp) = (e1,e1)x'y! + (e1,e2)x!y? 


+ (€2,¢1)x°y! + (e2,€2)x°y" 
= Ex'y! + Fx'y? +Fx’y! + Gry 


EI 


Thus, the three numbers FE, F,G determine completely the first fundamental form of 
M at p. They are called the coefficients of the first fundamental form relative to 
€1,€2. 

Similarly, the three numbers 


e:=II(e1,e1), f:=W(e1,e2),  g:= I (e2,e2) 


determine completely the second fundamental form of M at p: 


aaa lah 


36 §5 The Shape Operator 


They are called the coefficients of the second fundamental form relative to e),e2. 
As p varies in an open set U, if e},e2 remain a basis of 7,,M at each point, these 
coefficients are six functions on U. Classically, M is taken to be a coordinate patch 
(U,u,v) with e; = 0/du and ep = 0/dv, and the differential geometry of M is done 
in terms of the six functions E, F,G,e, f,g. 


Theorem 5.8. Suppose M and M' are two smooth Riemannian manifolds of dimen- 
sion 2, and ~p: M > M' is a diffeomorphism. Let E, F,G be the coefficients of the first 
fundamental form relative to a frame e,,e2 on M, and E',F',G' the corresponding 
coefficients relative to e := Qx€1, & := @,.e2 on M'. Then @ is an isometry if and 
only if E,F,G at p are equal to E',F',G' at 9(p), respectively, for all p € M. 


Proof. The diffeomorphism @ is an isometry if and only if 
(0,Xp, PY p) o(p) — (Xp, a: 
This condition holds if and only if 
(Qx€i; Px€j) o(p) = (ei, Cj) ps 
for all i= 1,2 and j = 1,2, ie., 


E'(g(p)) =E(p), F'(@(p)) =F(p), G'(e(p)) = G(p). 


5.5 The Catenoid and the Helicoid 


The graph of the hyperbolic cosine function 


e“te~* 
— h — 
y =coshx 5 
is called a catenary (Figure 5.2). 
y Zz 
See | 1+ 
1 
| r 

| | x =I 1 

—l 1 = eo 


y =coshx 


Fig. 5.2. Catenaries 
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Fig. 5.3. A hanging chain 


Using physics, it can be shown that a hanging chain naturally assumes the shape 
of a catenary (Figure 5.3). 

The surface of revolution obtained by rotating the catenary r = coshz about the 
z-axis is called a catenoid (Figure 5.4). It has parametrization 


(rcos @,rsin@,z) = ((coshu) cos @, (coshu) sin 8, u), 


where we set z = u. 


Fig. 5.4. A catenoid 


The helicoid is the surface with parametrization 


(rcos6,rsin0,6), —°0<1r0 <9; 
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it is the surface traced out by a horizontal stick moving upward with constant speed 
while rotating about a vertical axis through its midpoint (Figure 5.5). 


Fig. 5.5. A helicoid 


The catenoid shown in Figure 5.4 has 


-l<u<l O0<0<2z7. 


If we remove from this catenoid the points with vu = —1,1 and 0 = 0,27, then what 
is left is a coordinate chart U. 
Similarly, the helicoid in Figure 5.5 has 


-—a<r<a, 0<0<22 


for some positive real number a. We remove from it points with r=0,a and 0 =0,27 
to obtain a coordinate chart U’. 
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As coordinate charts, there are diffeomorphisms U ~ (—1,1) x (0,27) and U’ ~ 
(—a,a) x (0,27). Since (—1,1) is diffeomorphic to (—sinh1,sinh1) via the map 
ut++ sinhh, there is a diffeomorphism @: U — U’ given by 


((coshu) cos 8, (coshw) sin@,u) ++ ((sinhw) cos @, (sinhw) sin @, @). 


By computing the three functions E,F,G relative to the frame e; = 0/du, 
e2 = 0/00 on the catenoid and the frame e} = ,e1, e) = @,e2 on the helicoid, 
respectively, one can show that @: U — U’ is an isometry (Problem 5.11). Physi- 
cally, what this means is that if one cuts the catenoid in Figure 5.4 along a meridian, 
straighten out the meridian, and let the catenoid hang, it will assume the shape of the 
helicoid in Figure 5.5. 


Problems 


5.1. Eigenvalues and eigenvectors of a symmetric matrix 

Suppose A is an n x n real symmetric matrix with complex eigenvector X and corresponding 
complex eigenvalue A. Think of X as a column vector and consider the complex number 
XTAX, where X is the complex conjugate of X and the superscript T means the transpose. 


(a) Show that XTA = 1X7. 

(b) By computing the complex number X7AX in two ways, as (XTA)X and as X' (AX), show 
that A =. This proves that the eigenvalues of a real symmetric matrix are always real. 

(c) Show that A has n independent real eigenvectors. 

(d) Show that eigenvectors of A corresponding to distinct eigenvalues are orthogonal. (Hint: 
Let X and Y be eigenvectors corresponding to distinct eigenvalues A and LL, respectively. 
Note that (X,Y) = XTY. Compute X7AY two different ways.) 


Part (d) shows that if the principal curvatures at a point of a surface in R? are distinct, then the 
principal directions are orthogonal. 


5.2. Determinant and eigenvalues 
Prove that the determinant of any n x n matrix, real or complex, is the product of its eigenval- 
ues. (Hint: Triangularize the matrix.) 


5.3. The Gauss map 
Let M be a smooth oriented surface in R?, with a smooth unit normal vector field N. The 
Gauss map of M is the map 


v:M>S9 CR’, — v(p)=Np, 


where Np is considered to be a unit vector starting at the origin. Show that the differential of 
the Gauss map at p is the negative of the Weingarten map: 


Vx, p(Xp) = —L(Xp) 


for any Xp € T,M. (Hint: Compute the differential by using curves. See [21, Section 8.7].) 
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5.4. Total curvature 

The total curvature of a smooth oriented surface M in R? is defined to be the integral [ iu K, 
if it exists, of the Gaussian curvature K. Prove that the total curvature of M is, up to sign, the 
area of the image of the Gauss map: 


i: K = Area of v(M). 
M 


5.5. Shape operator of a cylinder 
Let M be the cylinder of radius a in R? defined by x? + y* =a’. At p = (x,y,z) € M, the 


vectors 
1 ra) £ 0 a 
1 a Ox ty fo Eo 


form an orthonormal basis for the tangent space T,M. Let N be the unit normal vector field 


1/0 0 
N=-—(x—+y— ). 
a (x Ox +5) 
(a) Find the matrix of the shape operator with respect to the basis e;,e2 at p. (Compute 


L(e;) = —De,N and find the matrix [a‘] such that L(e;) = La\ei.) 
(b) Compute the mean and Gaussian curvatures of the cylinder M. 


5.6. Shape operator of a sphere 
Let M be the sphere of radius a in R? defined by 


P+y42=a. 


Parametrize the sphere using spherical coordinates: 


x = asing cos 0 
y =asingsin@ 
Z=acos@, O0<¢o<n, 0<O0<2z. 


Then for each p € M, e; = 0/0, en = 0/06 is a basis for the tangent space T,M for p € M. 
Let N, be the unit outward normal vector at p on the sphere. 


(a) Find the matrix of the shape operator of the sphere with respect to the basis e1, e2 at p. 
(b) Compute the mean and Gaussian curvatures of the sphere M at p using (a). 


5.7. Surface of revolution in R? 

Let (f(u),g(u)) be a unit-speed curve without self-intersection in the (y,z)-plane. Assume 
f(u) > 0, so that (f(u),g(u)) can be rotated about the z-axis to form a surface of revolution 
M in R?. A parametrization of the surface of revolution is 


f(u)cosv 
w(u,v) = | f(u)sinv;, O<v<2z. 
g(u) 
Assume that y is a diffeomorphism onto its image, so that u,v are coordinates on a chart U 
in M. Then e; = 0/du, er = 0/Ovis a basis for the tangent space T,(M) for p € U. 
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(a) Find the matrix of the shape operator of the surface of revolution with respect to the basis 
1,2 at p. 
(b) Compute the mean and Gaussian curvature of the surface of revolution M at p. 


5.8. The average value of normal curvature 

Let M be an oriented surface in R? with a unit normal vector field N, and let p be a point 
in M. Each unit tangent vector X, € T,M determines a normal section with curvature K(X,) 
at p. The unit tangent vectors in T,M form a circle $ ! and the normal curvature x is a function 
on this circle. Show that the average value (1/27) Js, « of the normal curvature at p is the 
mean curvature at p. (In general, the average value of a function f on the unit circle S! is 


(1/2m) Jsi f.) 


5.9. The coefficients of the two fundamental forms 
Suppose M is an oriented surface in R, p € M, and e}, 9 is a basis for TyM. Let E,F,G,e, f,g 
be the coefficients of the first and second fundamental forms of M at p relative to e,é0. 


(a) Show that the matrix of the shape operator L: T,M — TpM relative to e1, e2 is 


EF] ‘fe 
FG fal 
(b) Compute the Gaussian curvature K and the mean curvature H of M at p in terms of 


E,F,G,e,f,g. 


5.10. The first and second fundamental forms of a graph 
Let M be the graph of a C® function z = h(x,y) in R?. Then M can be parametrized by 
o(x,y) = (x, y,A(x,y)). Let e; = 6,(0/dx) and ey = 0,(0/dy). 


(a) Show that the coefficients of the first fundamental form relative to e;,e2 are 
E=1+hj,, F=hhy, G=1+h,. 


(Here hy = dh/Ox.) 
(b) Compute the coefficients of the second fundamental forms relative to e;,e9. 
(c) Show that the Gaussian curvature of the graph z = h(x, y) is 


__ Ixxhyy — hy 
(he + he +1) 


5.11. Isometry between a catenoid and a helicoid 
Let U be the open set in the catenoid with parametrization 


(x,y,z) = ((coshu) cos @,(coshu)sin@,u), —l<u<1, 0<@<2z, 
and let U’ be the open set in the helicoid with parametrization 
(x,y,z) = (u'cos @,u'sin®@,@), —sinhl <u’ <sinhl, 0<6<2z. 
Define g: U — U’ by 
9 ((coshu) cos , (coshu) sin @,w) = ((sinhu) cos 6, (sinhw) sin @,@). 


As explained in Section 5.5, @ is a diffeomorphism. 
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Viewing u, 0 as coordinate functions on U, we let e; = 0/du, e2 = 0/00, which are then 
tangent vector fields on U. 


(a) Compute e},e2 in terms of 0/dx,0/dy,0/dz. 

(b) Compute E, F,G relative to the frame e,e at p € U of the catenoid. 

(c) Let e, = @x€1,€5 = Px ep on the open set U’ of the helicoid. Compute e/,e5 in terms of 
0/dx,0/dy,0/dzat P(p) EU". 

(d) Compute E’, F’,G’ relative to the frame e},,¢5. 

(e) Prove that : U > U’ is an isometry. 
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$6 Affine Connections 


Consider a smooth vector field Y on a manifold M and a tangent vector X, € T,M 
at a point p in M. To define the directional derivative of Y in the direction Xj, it is 
necessary to compare the values of Y in a neighborhood of p. If g is a point near p, 
in general it is not possible to compare the vectors Y, and Y, by taking the difference 
Y, — Yp, since they are in distinct tangent spaces. For this reason, the directional 
derivative of a vector field on an arbitrary manifold M cannot be defined in the same 
way as in Section 4. Instead, we extract from the directional derivative in R” certain 
key properties and call any map D: X(M) x X(M) — X(M) with these properties an 
affine connection. Intuitively, an affine connection on a manifold is simply a way of 
differentiating vector fields on the manifold. 

Mimicking the directional derivative in IR”, we define the torsion and curvature 
of an affine connection on a manifold M. Miraculously, both torsion and curvature 
are linear over C™ functions in every argument. 

We will see in a later section that there are infinitely many affine connections on 
any manifold. On a Riemannian manifold, however, there is a unique torsion-free 
affine connection compatible with the metric, called the Riemannian or Levi-Civita 
connection. As an example, we describe the Riemannian connection on a surface 
in R?. 


6.1 Affine Connections 


On an arbitrary manifold M, which is not necessarily embedded in a Euclidean space, 
we can define the directional derivative of a C™ function f in the direction X, € T,M 
in the same way as before: 
Vx,f =Xpf. 

However, there is no longer a canonical way to define the directional derivative of a 
vector field Y. Formula (4.2) fails because unlike in R”, there is no canonical basis 
for the tangent space 7,M. 

Whatever definition of directional derivative of a vector field one adopts, it should 
satisfy the properties in Proposition 4.2. This motivates the following definition. 


Definition 6.1. An affine connection on a manifold M is an R-bilinear map 

V: X(M) x X(M) > X(M), 
written VyY for V(X,Y), satisfying the two properties below: if F is the ring C°(M) 
of C® functions on M, then for all X,Y € ¥(M), 


(i) VxY is F-linear in X, 
(ii) (Leibniz rule) VxY satisfies the Leibniz rule in Y: for f € F, 


Vx (FY) = (Xf)¥ + fVxY. 


Example 6.2. The directional derivative Dx of a vector field Y on R” is an affine 
connection on R”, sometimes called the Euclidean connection on R". 


44 86 Affine Connections 
6.2 Torsion and Curvature 


Given an affine connection V on a manifold M, one might ask whether it satisfies 
the same properties as Proposition 4.3 for the Euclidean connection on R”. For 
X,Y € X(M), set 
T (X,Y) =VxY —VyX — [X,Y] © X(M), 
R(X,Y) = [Vx, Vy] —Vixy) 
= VxVy VyVx Vix.y] End(X(M)). 


We call T the torsion and R the curvature of the connection. There does not seem 
to be a good reason for calling T(X,Y) the torsion, but as we shall see in Section 8, 
R(X,Y) is intimately related to the Gaussian curvature of a surface. 

An affine connection V on a manifold M gives rise to a linear map 


%(M) —> Endp(X(M)), XW Vx. 


Here both vector spaces X(M) and Endpy(X(M)) are Lie algebras. The curvature 
measures the deviation of the map X ++ Vx from being a Lie algebra homomorphism. 

Recall that F is the ring C°(M) of C® functions on the manifold M. Although 
VxY is not F-linear in Y, it turns out, amazingly, that both torsion and curvature are 
F-linear in all their arguments. 


Proposition 6.3. Let X,Y,Z be smooth vector fields on a manifold M with affine 
connection V. 


(i) The torsion T (X,Y) is ¥-linear in X and in Y. 
(ii) The curvature R(X ,Y)Z is ¥-linear in X, Y, and Z. 


Proof. We will first check the ¥-linearity of R(X,Y)Z in X. For this, it is useful to 
recall the following formula from the theory of manifolds (Problem 4.1): if f,g € 
F =C*(M), then 


[fX,8Y] = fglX,Y]+ f(Xg)¥ — g(Vf)X. (6.1) 
By the definition of curvature, 
R(fX,Y)Z = VexVyZ— Vy V pxZ — Vigx yjZ- (6.2) 


By the F-linearity of V in X, the first term in (6.2) is fVyxVyZ, and by the Leibniz 
rule, the second term is 


Vy (fVxZ) = (Yf)VxZ + fVyVxZ. 
Applying (6.1), the last term in (6.2) is 


Virxv1Z = Very pxZ 
= fVixyjZ— (VY S)VxZ, 
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since X(1) = 0. Combining the three terms gives 


R(fX,Y)Z = f(VxVyZ—VyVxZ— Vix y/Z) 
= fR(X,Y)Z. 


Because R(X,Y) is skew-symmetric in X and Y, F-linearity in Y follows from 
F-linearity in the first argument: 


R(X, fY) = —R(fY,X) =—fR(Y,X) = fR(X,Y). 


We leave the F-linearity of T(X,Y) as well as that of R(X,Y)Z in Z as exercises 
(Problems 6.2 and 6.3). 


6.3 The Riemannian Connection 


To narrow down the number of affine connections 
on a manifold, we impose additional conditions on a 
connection. On any manifold, we say that a connec- 
tion is torsion-free if its torsion is zero. On a Rieman- 
nian manifold, we say that a connection is compatible 
with the metric if for all X,Y,Z © X(M), 


Z(X,Y) = (VzX,Y) + (X,Vz¥). 


It turns out that these two additional conditions are 
enough to determine a connection uniquely on a Rie- 
mannian manifold. 


Definition 6.4. On a Riemannian manifold a Rie- 
mannian connection, sometimes called a Levi-Civita 
connection, is an affine connection that is torsion- 
free and compatible with the metric. Tullio Levi-Civita 


Lemma 6.5. A C™ vector field X on a Riemannian (1873-1941) 
manifold (M,(, )) is uniquely determined by the val- 
ues (X,Z) for all Z € X(M). 


Proof. We need to show that if X’ € ¥(M) and (X,Z) = (X’,Z) for all Z € X(M), 
then X = X’. With Y = X — X’, this is equivalent to showing that if (Y,Z) = 0 for all 
Z € X(M), then Y = 0. Take Z = Y. By the positive-definiteness of the inner product 
at each point p € M, 


(Y,Y) =0 => (¥,Y,) =0 for all pe M 
=> Y, =O forall pe M 
=> Y=0. 


Theorem 6.6. On a Riemannian manifold there is a unique Riemannian connection. 
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Proof. Suppose V is a Riemannian connection on M. By Lemma 6.5, to specify 
VxY, it suffices to know (VxY,Z) for every vector field Z € ¥(M). So we will try 
to find a formula for (VxY,Z) involving only the Riemannian metric and canonical 
operations on vector fields such as the Lie bracket. 
A Riemannian connection satisfies the two formulas 
VxY —VyX — [X,Y] =0 (6.3) 
and 
X(Y,Z) = (Vx¥,Z) + (¥,VxZ). (6.4) 


Cyclically permuting X, Y, Z in (6.4) gives two other formulas: 


Y(Z,X) = (VyZ,X) Tv (Z,VyX), (6.5) 
Z(X,Y) = (W2X,¥) + (X,Vz¥). (6.6) 


Using (6.3) we can rewrite VyX in (6.5) in terms of VxY: 


Subtracting (6.6) from (6.4) and then adding (6.7) to it will create terms involving 
VxZ—VzX and VyZ —VzY, which are equal to [X, Z] and [Y,Z] by torsion-freeness: 


X(Y,Z)+Y (Z,X) —Z(X,Y) 
=2(VyxY,Z) + (Y, VxZ—VzX) + (X,VyZ—VzY) = (Z, [X,Y]) 
=2(VyY,Z) + (Y,[X,Z]) + (X, [Y,Z]) — (Z, [X,¥]). 


Solving for (VxY,Z), we get 


2VxY,Z) = X(Y,Z) +¥(Z,X) —Z(X,Y) 


— (X,[Y,Z]) + (Y,[Z,X]) + (Z, [X,Y]). oe) 


This formula proves that a Riemannian connection, if it exists, is unique. 

Define VxY by the formula (6.8). It is a straightforward exercise to check 
that so defined, V is a torsion-free affine connection compatible with the metric 
(Problem 6.4). This proves the existence of a Riemannian connection on a Rie- 
mannian manifold. 


Example 6.7. By Proposition 4.3, the Riemannian connection on R” with its usual 
Euclidean metric is the directional derivative DyY on R”. 


6.4 Orthogonal Projection on a Surface in R? 


Suppose M is a smooth surface in R*, which we do not assume to be orientable. At 
each point p € M, let v, be the normal line through p. It is perpendicular to the 
tangent space T,,M, so there is an orthogonal decomposition 


TR? ~ TyM © Vp. 


Let pr,: T,R? — T,M be the projection to the tangent space of M at p. 
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If X is aC™ vector field along M, then X, € T,R? . Define a vector field pr(X) on 
M by 


pr(X)» = pr, (Xp) € TM. 


Proposition 6.8. /f X is a C® vector field along M in R°, then the vector field pr(X) 
on M defined above is C™. 


Proof. For each p € M, there is a neighborhood U of p in M on which there is a C™ 
unit normal vector field NV (by Section 5.1). For g € U, 


pr, (Xq) = Xq— (Xq,Nq)Nq- 


Note that pr, does not depend on the choice of the unit normal vector field; —Ng 
would have given the same answer. As q varies in U, 


pr(X) =X — (X,N)N, 


which shows that pr(X) is C” on U. Hence, pr(X) is C® at p. Since p is an arbitrary 
point of M, the vector field pr(X) is C® on M. 


According to this proposition, the projection operator is a map 


pr: [(7R3|m) > £(M). 


6.5 The Riemannian Connection on a Surface in R° 


We continue to consider a smooth, not necessarily orientable surface M in R*. Given 
two C™ vector fields X and Y on M and a point p € M, the directional derivative 
Dx,Y is not in general tangent to M. Define 


(VxY)p = Vee = pr, (Dx,Y) E T,M, (6.9) 


where pr,,: T,R? — T,M is the orthogonal projection defined in Section 6.4. As p 
varies over M, 


VxY = pr(DxY), 
which, by Proposition 6.8, shows that VxY is a C™ vector field on M. So we have a 
map 
V: X(M) x &(M) > X(M). 


Proposition 6.9. The Riemannian connection on a smooth, not necessarily ori- 
entable surface M in R? is given by VxY = pr(DxY). 


Proof. Problem 6.6. 
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Problems 


6.1. Convex linear combination of connections 

Show that if V and V’ are connections on a manifold, then their sum V + V’ is not a connection. 

However, a convex linear combination rV + (1 —1)V’ for any t € R is a connection. More 

generally, show that if V!,...,V” are connections on a manifold, then the linear combination 
;_, @jV' is a connection provided ¥?_, a; = 1. 

6.2. F-linearity of the torsion 

Prove that the torsion T(X,Y) is ¥-linear in X and in Y. 


6.3. F-linearity of the curvature 
Prove that the curvature R(X,Y)Z is F-linear in Z. 


6.4. Existence of a Riemannian connection 
Prove that formula (6.8) defines a torsion-free affine connection V that is compatible with the 
metric. 


6.5. Orthogonal projection 
Let M be a smooth surface in R*. Show that the orthogonal projection pr: '(7R?|y) — X(M) 
is F-linear. 


6.6. Riemannian connection on a surface in R? 
Prove Proposition 6.9. 


6.7. Lie derivative 
On any manifold M, let £: X(M) x X(M) > X(M), £(X,Y) = Ly, be the Lie derivative. 
Show that £yx¥ satisfies the Leibniz rule in Y, but is not F-linear in X. (Hint: LyY = [X,Y].) 


6.8. Riemannian connection of a submanifold 
Let § C M be an immersed submanifold of a Riemannian manifold M and let V be the 
Riemannian connection on M. Show that the Riemannian connection V’ on S is given by 


VAY = (VxY)tan- 
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$7 Vector Bundles 


The set X(M) of all C® vector fields on a manifold M has the structure of a real 
vector space, which is the same as a module over the field R of real numbers. Let 
F = C*(M) again be the ring of C* functions on M. Since we can multiply a vector 
field by a C® function, the vector space %(M) is also a module over F. Thus the set 
%(M) has two module structures, over R and over F. In speaking of a linear map: 
3%(M) — X(M) one should be careful to specify whether it is R-linear or ¥-linear. 
An 3-linear map is of course R-linear, but the converse is not true. 

The F-linearity of the torsion T(X,Y) and the curvature R(X ,Y)Z from the pre- 
ceding section has an important consequence, namely that these two constructions 
make sense pointwise. For example, if X,,Y,, and Z, are tangent vectors to a mani- 
fold M at p, then one can define R(X,,Y,)Z, to be (R(X,Y)Z), € T,M for any vector 
fields X, Y, and Z on M that extend X,, Y,, and Z,, respectively. While it is possible 
to explain this fact strictly within the framework of vector fields, it is most natural to 
study it in the context of vector bundles. For this reason, we make a digression on 
vector bundles in this section. 

We will try to understand F-linear maps from the point of view of vector bundles. 
The main result (Theorem 7.26) asserts the existence of a one-to-one correspondence 
between F-linear maps a: T(E) + T(F) of sections of vector bundles and bundle 
maps 9: E > F. 


7.1 Definition of a Vector Bundle 


Given an open subset U of a manifold M, one can think of U x R’ as a family 
of vector spaces R” parametrized by the points in U. A vector bundle, intuitively 
speaking, is a family of vector spaces that locally “looks” like U x R’. 


Definition 7.1. AC surjection 7: E — M is aC® vector bundle of rank r if 


(i) For every p € M, the set E, := 27! (p) is areal vector space of dimension r; 
(ii) every point p € M has an open neighborhood U such that there is a fiber- 
preserving diffeomorphism 


ou: x 1(U) >U xR" 


that restricts to a linear isomorphism E, — {p} x R’ on each fiber. 


The space E is called the total space, the space M the base space, and the space 
E, the fiber above p of the vector bundle. We often say that E is a vector bundle 
over M. A vector bundle of rank | is also called a line bundle. 

Condition (i) says that 7: E — M is a family of vector spaces, while condition 
(ii) formalizes the fact that this family is locally looks like R”. We call the open set 
U in (ii) a trivializing open subset for the vector bundle, and dy a trivialization of 
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m~'(U). A trivializing open cover for the vector bundle is an open cover {Ug} of M 
consisting of trivializing open sets Uz, together with trivializations ¢q: m~'(Ug) > 
Ug x R’. 


Example 7.2 (Product bundle). If V is a vector space of dimension r, then the pro- 
jection z: M x V > M isa vector bundle of rank r, called a product bundle. Via the 
projection z: S! x R — S!, the cylinder S! x R is a product bundle over the circle S!. 


Example 7.3 (Mobius strip). The open Mobius strip is the quotient of [0, 1] x R by 
the identification 
(0,1) -~ (Lt), 


It is a vector bundle of rank 1 over the circle (Figure 7.1). 


Fig. 7.1. A Mobius strip 


Example 7.4 (Restriction of a vector bundle). Let S be a submanifold of a manifold 
M, and 2: E+ MaC® vector bundle. Then zs: 2~!(S) — Sis also a vector bundle, 
called the restriction of E to S, written E|s := 2~!(S) (Figure 7.2). 


Fig. 7.2. A vector bundle on M restricted to S 
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Definition 7.5. Let m2: E > M andar: F + NbeC™ vector bundles. A C® bundle 
map from E to F is a pair of C™ maps (gp: E > F,9: M—> N) such that 


(i) the diagram 


commutes, 
(ii) @ restricts to a linear map @,: E, > Fo p) Of fibers for each p € M. 


Abusing language, we often call the map @: E — F alone the bundle map. 

An important special case of a bundle map occurs when F and F are vector 
bundles over the same manifold M and the base map @ is the identity map Ly. In 
this case we call the bundle map (¢: E > F,1y) a bundle map over M. 

If there is a bundle map y: F — E over M such that yop =I, andgoy=I1,, 
then @¢ is called a bundle isomorphism over M, and the vector bundles E and F are 
said to be isomorphic over M. 


Definition 7.6. A vector bundle 7: E — M is said to be trivial if it is isomorphic to 
a product bundle M x R” > M over M. 


Example 7.7 (Tangent bundle). For any manifold M, define TM to be the set of all 
tangent vectors of M: 


TM = {(p,v) |p EM, ve T,M}. 


If U is a coordinate open subset of M, then TU is bijective with the product bundle 
U x R". We give TM the topology generated by TU as U runs over all coordinate 
open subsets of M. In this way TM can be given a manifold structure so that TM 
M becomes a vector bundle. It is called the tangent bundle of M (for details, see [21, 
Section 12]). 


Example 7.8. If f: M — N is aC®™ map of manifolds, then its differential gives rise 
to a bundle map f.: TM — TN defined by 


fx(P,v) = (F(P), fp ))- 


7.2 The Vector Space of Sections 


A section of a vector bundle 7: E — M over an open set U is a function s: U > E 
such that 7 o s = lly, the identity map on U. For each p € U, the section s picks out 
one element of the fiber E,. The set of all C™ sections of E over U is denoted by 
T(U,E). If U is the manifold M, we also write '(Z) instead of [(M,E). 
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The set [(U, E) of C® sections of E over U is clearly a vector space over R. It is 
in fact a module over the ring C*(U) of C® functions, for if f is a C® function over 
U and sis aC™ section of E over U, then the definition 


(fs)(p) = f(p)s(p) € Ep, peu, 
makes fs into a C™ section of E over U. 


Example 7.9 (Sections of a product line bundle). A section s of the product bundle 
M x RM isamap s(p) = (p, f(p)). So there is a one-to-one correspondence 


{sections of M x R > M} <—> {functions f: M— R}. 


In particular, the space of C® sections of the product line bundle M x R — M may 
be identified with C°(M). 


Example 7.10 (Sections of the tangent bundle). A vector field on a manifold M as- 
signs to each point p € M a tangent vector X, € T,M. Therefore, it is precisely a 
section of the tangent bundle TM. Thus, X(M) =T(TM). 


Example 7.11 (Vector fields along a submanifold). If M is a regular submanifold of 
IR", then a C® vector field along M is precisely a section of the restriction TR”|y of 
TR" to M. This explains our earlier notation '(7R*|v) for the space of C” vector 
fields along M in R?. 


Definition 7.12. A bundle map ~: E — F over a manifold M (meaning that the 
base map is the identity 1.) induces a map on the space of sections: 
px: T(E) > TF), 
p(s) = Qos. 


This induced map @y is F-linear because if f € F, then 


(gx(Fs)) (Pp) = (@ ° (Fs) (p) = 9(F(P)s(P)) 
— f(P)e(s(p)) (because @ is R-linear on each fiber) 


= (f¢#(s)) (p). 


Our goal in the rest of this chapter is to prove that conversely, every F-linear map 
a: T(E) +T(F) comes from a bundle map 9: E > F, i.e., & = x. 


7.3 Extending a Local Section to a Global Section 


Consider the interval (—2/2, 2/2) as an open subset of the real line R. The example 
of the tangent function tan: (—2/2,7/2) — R shows that it may not be possible to 
extend the domain of a C® function f: U — R from an open subset U C M to the 
manifold M. However, given a point p € U, it is always possible to find a C™ global 
function f: M —> R that agrees with f on some neighborhood of p. More generally, 
this is also true for sections of a vector bundle. 
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P 
kK— U —>| 


Fig. 7.3. A bump function supported in U 


Proposition 7.13. Let E — M be aC®™ vector bundle, s a C® section of E over some 
open set U in M, and p a point in U. Then there exists a C~ global section § € 
I'(M, E) that agrees with s over some neighborhood of p. 


Proof. Choose a C™ bump function f on M such that f = 1 on a neighborhood W of 
p contained in U and supp f Cc U (Figure 7.3). Define 5: M — E by 


- \__ J f(qg)s(q) forq eu, 
a= 4) forg ZU. 


On U the section 5 is clearly C™ for it is the product of a C™ function f and a C™ 
section s. 

If p Z U, then p ¢ supp f. Since supp f is a closed set, there is a neighborhood V 
of p contained in its complement M \ supp f. On V the section J is identically zero. 
Hence, 5 is C™ at p. This proves that 5 is C* on M. 

On W, since f = 1, the section § agrees with s. 


7.4 Local Operators 


In this section, EF and F are C™ vector bundles over a manifold M, and F is the ring 
C”(M) of C* functions on M. 


Definition 7.14. Let E and F be vector bundles over a manifold M. An R-linear 
map a: T(E) > T(F) is a local operator if whenever a section s € T(E) vanishes 
on an open set U in M, then a(s) € I'(F) also vanishes on U. It is a point operator 
if whenever a section s € I'(E) vanishes at a point p in M, then a(s) € T(F) also 
vanishes at p. 


Example 7.15. By Example 7.9, the vector space C”(IR) of C” functions on R may 
be identified with the vector space '(R x R) of C® sections of the product line bundle 
over R. The derivative 4 

—:cC*(R) 9 C*(R 

&:C°(R) +C7(R) 

is a local operator since if f(t) =0 on U, then f(t) = 0 on U. However, d/dt is not 
a point operator. 
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Fig. 7.4. The product fs of a bump function f and a section s is zero. 


Example 7.16. Let Q*(M) denote the vector space of C® k-forms on a manifold M. 
Then the exterior derivative d: Q'(M) — Qk*!(M) is a local operator. 


Proposition 7.17. Let E and F be C* vector bundles over a manifold M, and 
F=C*(M). Ifa map a: T(E) +T(F) is F-linear, then it is a local operator. 


Proof. Suppose the section s € T(E) vanishes on the open set U. Let p € U and let 
f be a C® bump function such that f(p) = 1 and supp f C U (Figure 7.3). Then 
fs € T(E) and fs = 0 on M (Figure 7.4). So a(fs) = 0. By ¥-linearity, 


0=a(fs) = fa(s). 


Evaluating at p gives a(s)(p) =0. Since p is an arbitrary point of U, a(s) =OonU. 


Example 7.18. On a C* manifold M, a derivation D: C*(M) — C*(M) is R-linear, 
but not F-linear since by the Leibniz rule, 


D(fg) =(Df)g+fDg, for f,g EF. 
However, by Problem 7.1, D is a local operator. 


Example 7.19. Fix a C® vector field X € X(M). Then a connection V on M induces 
a map 
Vx: X(M) > X(M) 


that satisfies the Leibniz rule. By Problem 7.2, Vx is a local operator. 


7.5 Restriction of a Local Operator to an Open Subset 


A continuous global section of a vector bundle can always be restricted to an open 
subset, but in general a section over an open subset cannot be extended to a contin- 
uous global section. For example, the tangent function defined on the open interval 
(—2/2,2/2) cannot be extended to a continuous function on the real line. Nonethe- 
less, a local operator, which is defined on global sections of a vector bundle, can 
always be restricted to an open subset. 
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Theorem 7.20. Let E and F be vector bundles over a manifold M. If a: T(E) > 
I'(F) is a local operator, then for each open subset U of M there is a unique linear 
map, called the restriction of a to U, 


oy: T(U,E) >T(U,F) 
such that for any global section t € T(E), 
ou (tly) = alt)Iu. (7.1) 


Proof. Lets €T(U,E) and p € U. By Proposition 7.13, there exists a global section 
5 of E that agrees with s in some neighborhood W of p in U. We define ay (s)(p) 
using (7.1): 

ay (s)(p) = a(3) (Pp). 


Suppose § € T(E) is another global section that agrees with s in the neighborhood 
W of p. Then 5 =F in W. Since @ is a local operator, «(5) = o(5) on W. Hence, 
a(5)(p) = a(5)(p). This shows that oy (s)(p) is independent of the choice of 5, so 
Qy is well defined and unique. Fix p € U. If s © T(U,E) and 5 € T(M,E) agree on 
a neighborhood W of p, then ay(s) = a(5) on W. Hence, oy(s) is C” as a section 
of F. 

If t © T(M,E) is a global section, then it is a global extension of its restriction 
tly. Hence, 


for all p € U. This proves that ay (tly) = a(t)|v. 


Proposition 7.21. Let E and F be C™ vector bundles over a manifold M, let U be 
an open subset of M, and let J(U) =C*(U), the ring of C” functions on U. If 
a: T(E) > I(F) is ¥-linear, then the restriction ay : T(U,E) > T(U,F) is ¥(U)- 
linear. 


Proof. Lets €V(U,E) and f € F(U). Fix p € U and let § and f be global extensions 
of s and f that agree with s and f, respectively, on a neighborhood of p (Proposition 
7.13). Then 


au (fs)(p) = a(f5)(p) (definition of oy) 
= f(p)a(5)(p) — (F-linearity of a) 
= f(p)au(s)(P) 


Since p is an arbitrary point of U, 


au (fs) = fou(s), 


proving that ay is F(U)-linear. 
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A frame for a vector bundle E of rank r over an open set U is a collection of sections 
e1,.-..,e, of E over U such that at each point p € U, the elements e;(p),...,e,(p) 
form a basis for the fiber Ep. 


Proposition 7.22. A C~ vector bundle 1: E — M is trivial if and only if it has a C™ 
frame. 


Proof. Suppose E is trivial, with C™ trivialization 
¢:E>SMxR’. 


Let v1,...,v, be the standard basis for R”. Then the elements (p,v;), i = 1,...,r, 
form a basis for {p} x R” for each p € M, and so the sections of E 


e(p) =o '(p,vi), i=l 


provide a basis for E, at each point p € M. 
Conversely, suppose e),...,é, is a frame for E — M. Then every point e € F isa 
linear combination e = }\a'e;. The map 


o(e) = (x(e),a',...,a"): E>MxR’ 
is a bundle map with inverse 
y:MxR’ OE, 
w(p,a',...,a") = Sa'(p)ei(p). 


It follows from this proposition that over any trivializing open set U of a vector 
bundle F, there is always a frame. 


7.7 F-Linearity and Bundle Maps 


Throughout this subsection, E and F are C® vector bundles over a manifold M, and 
F = C”(M) is the ring of C” real-valued functions on M. We will show that an 
F¥-linear map a: T(E) — T(F) can be defined pointwise and therefore corresponds 
uniquely to a bundle map E > F. 


Lemma 7.23. An F-linear map a: T(E) + T(F) is a point operator. 


Proof. We need to show that if s € T(£) vanishes at a point p in M, then a(s) € T(F) 
also vanishes at p. Let U be an open neighborhood of p over which E is trivial. Thus, 
over U it is possible to find a frame e;,...,e,- for E. We write 


slu = Yale, a €C°(U) =F(U). 
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Because s vanishes at p, all a'(p) =0. Since @ is F-linear, it is a local operator 
(Proposition 7.17) and by Theorem 7.20 its restriction ay: T(U,E) > T(U,F) is 
defined. Then 


a(s)(p) = ay (s|u)(p) (Theorem 7.20) 
= ay (Ya'ei) (p) 
= (Ya'ay (e))(p) (ay is ¥(U)-linear (Proposition 7.21)) 


= Yai(p)au(ei)(p) = 0. 


Lemma 7.24. Let E and F be C® vector bundles over a manifold M.A _fiber- 
preserving map ~: E — F that is linear on each fiber is C™ if and only if @g takes 
C™ sections of E to C™ sections of F. 


Proof. (=) If @ is C®, then @4(s) = @ os clearly takes a C® section s of E toa C* 
section of F. 

(<) Fix p € M and let (U,x!,...,x") be a chart about p over which E and F are both 
trivial. Let e,...,e, € T(E) be a frame for E over U. Likewise, let f\,..., fin €T(F) 
be a frame for F over U. A point of E|y can be written as a unique linear combination 
Yale;. Suppose 


Poe = bi fi. 


In this expression the bi’s are C™ functions on U, because by hypothesis @ o e; = 
(x(e;) is aC” section of F. Then 


Qo (Dales) = Val bi fi 
j i,j 


One can take local coordinates on Ely to be (x!,...,x",a! 


local coordinates, 


1 nA r 1 n jz1 j p,m 
Q(x ,...,X",a,...,@)= (x eg Lath, Laer), 
j 


J 


,---,4@'). In terms of these 


which is a C® map. 


Proposition 7.25. If a: T(E) > T(F) is F-linear, then for each p € M, there is a 
unique linear map (py: Ey — F, such that for alls € T(E), 


p(s(p)) = &(s)(p). 
Proof. Given e € Ep, to define @,(e), choose any section s € T(E) such that s(p) =e 
(Problem 7.4) and define 
Pp(e) = a(s)(p) € Fp. 
This definition is independent of the choice of the section s, because if s’ is another 


section of E with s'(p) = e, then (s—s’)(p) = 0 and so by Lemma 7.23, we have 
a(s—s')(p) =0,ie., 
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Let us show that ~,: E, — Fy is linear. Suppose e),e2 € E, and a),a2 € R. 
Let 51,52 be global sections of E such that s;(p) = e;. Then (ais; + a252)(p) = 
ae, +a2e2, SO 


Pp(aje; +a2e2) = O(ay8; +4282) (p) 
= a1 0(s1)(p) +. a20(s2)(p) 
= a1 p(e1) + a2@p(e2). 


Theorem 7.26. There is a one-to-one correspondence 


{bundle maps 9: E — F} «> {§-linear maps a: T(E) >T(F)}, 


Proof. We first show surjectivity. Suppose ~@: [(£) + I(F) is F-linear. By the 
preceding proposition, for each p € M there is a linear map @,: E, — F, such that 
for any s € T(E), 


@p(s(p)) = a(s)(p). 


Define 9: E — F by 9(e) = @,(e) ife € Ep. 
For any s € T(E) and for every p € M, 


(9#(s))(P) = 9(s(p)) = a(s)(p), 


which shows that @ = @y. Since @y takes C™ sections of E to C™ sections of F, by 
Lemma 7.24 the map @: E — F is C”. Thus, @ is a bundle map. 

Next we prove the injectivity of the correspondence. Suppose 9, y: E — F are 
two bundle maps such that oy = yy: T(E) > T(F). For any e € Ep, choose a section 
s € T(E) such that s(p) = e. Then 


Hence, 9 = y. 


Corollary 7.27. An F-linear map @: X(M) > C*(M) is aC® I-form on M. 


Proof: By Proposition 7.25, one can define for each p € M, a linear map @,: T,M > 
R such that for all X € X(M), 


@)(Xp) = 0(X)(p). 


This shows that @ is a 1-form on M. 
For every C® vector field X on M, @(X) is a C® function on M. This shows that 
as a l-form, @ is C™. 
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7.8 Multilinear Maps over Smooth Functions 


By Proposition 7.25, if @: T(E) + T(F) is an F-linear map of sections of vector 
bundles over M, then at each p € M, it is possible to define a linear map @,: E, — Fp 
such that for any s € T(E), 


This can be generalized to F-multilinear maps. 
Proposition 7.28. Let E, E', F be vector bundles over a manifold M. If 
a: T(E) x T(E’) + T(F) 
is ¥-bilinear, then for each p € M there is a unique R-bilinear map 
Qp: Ep X Ey + Fp 
such that for all s € T(E) and s' € T(E’), 
®p(s(P),8'(p)) = (a(s,5'))(p). 


Since the proof is similar to that of Proposition 7.25, we leave it as an exercise. 
Of course, Proposition 7.28 generalizes to F-linear maps with k arguments. Just 
as in Corollary 7.27, we conclude that if an alternating map 


@: X(M) x ++. x X(M)(k times) > C”(M) 


is F-linear in each argument, then @ induces a k-form @ on M such that for 
X1,..-,Xe € X(M), 


Oy Ai pieces Ay) = (w(X1,...,Xx))(p). 


It is customary to write the k-form @ as @. 


Problems 


7.1. Derivations are local operators 
Let M be a smooth manifold. Show that a derivation D: C*(M) — C*(M) is a local operator. 


7.2. The Leibniz rule and local operators 
Let «: E > M be a vector bundle and let X € X(M). A map ay : T(E) — T(E) satisfies the 
Leibniz rule if for any f € C°(M) ands € T(E), 


ox (fs) = (Xf)s+ fax(s). 


Such a map Gy is of course not F-linear, but show that it is a local operator. 
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7.3. The Lie bracket and the Leibniz rule 
Let X be a smooth vector field on a smooth manifold M. Define a linear map ay : X(M) > 
X(M) by ax (Y) = [X,Y]. Show that cy satisfies the Leibniz rule. 


7.4. Section with a prescribed value 
Suppose 7: E —+ M is a C™ vector bundle. Let p € M and e € Ey. Show that E has a C™ 
section s with s(p) =e. 


7.5. Coefficients relative to a frame 
Suppose the vector bundle 7: E —> M has a C™ global frame e),...,ez. Then every point v in 
E can be written uniquely in the form }/a/(v)e;. Prove that the functions a/: E — R are C®. 


7.6. F-bilinear maps 
Prove Proposition 7.28. 
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$8 Gauss’s Theorema Egregium 


For a surface in IR? we defined its Gaussian curvature K at a point p by taking normal 
sections of the surface, finding the maximum ky and the minimum k> of the curvature 
of the normal sections, and setting K to be the product of kK; and k. So defined, the 
Gaussian curvature evidently depends on how the surface is isometrically embedded 
in R?. 

On the other hand, an abstract Riemannian manifold has a unique Riemannian 
connection. The curvature tensor R(X, Y) of the Riemannian connection is then com- 
pletely determined by the Riemannian metric and so is an intrinsic invariant of the 
Riemannian manifold, independent of any embedding. We think of a surface in R? as 
a particular isometric embedding of an abstract Riemannian manifold of dimension 
2. For example, both a plane and a cylinder are locally isometric embeddings of the 
same abstract surface, as one sees by simply bending a piece of paper. We will show 
in this chapter that the Gaussian curvature of a surface in R* is expressible in terms 
of the curvature tensor R(X,Y) and the metric. Hence, it too depends only on the 
metric, not on the particular embedding into R?. 


8.1 The Gauss and Codazzi—Mainardi Equations 


Suppose M is a regular submanifold of R”. For X a tangent vector field on M and 
Z a vector field along M in R", we defined in Section 4.5 the directional derivative 
DxZ. It is a vector field along M in R”. 

For X,Y € X(M) and Z €T(TR"|u), we verified in Proposition 4.10 the equation 


R(X,Y)Z = Dx DyZ— DyDxZ— Diy yj\Z =0. (8.1) 


Note that this is not the same curvature operator as the curvature operator of the 
directional derivative on R”. The earlier curvature operator was a map 


R: %(R") x ¥(R") x ¥(R") > X(R"); 
the current curvature operator is a map 
R: &(M) x ¥(M) x ¥(TR"|M) > X(TR"|M). 


Since the Riemannian connection V of a surface M in R? is defined in terms 
of the directional derivative D on M, it is easy to compare the curvature tensors of 
V and D. This will lead to a formula for the curvature R of V, called the Gauss 
curvature equation. 

A vector field Y € X(M) is also a vector field along M in R*. Hence, if X,Y € 
3(M), the directional derivative Dy Y makes sense. Assume that M is oriented with a 
unit normal vector field N. At any point p € M, the normal component of the vector 
Dx,Y is (Dx, Y,Np)Np, and therefore the tangent component is 
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pr(Dx,¥) = Dx, Y — (Dx, ¥,Np)Np- 
By (6.9), this is the definition of the Riemannian connection Vx, on M. Hence, 
Dx, Y =Vx,Y + (Dx,¥,Np)Np 

=Vx,¥ + (Lp(Xp),¥p)Np (by Lemma 5.2). (8.2) 

Globalizing this equation, we have, for X,Y € ¥(M), 
DyY =VxY + (L(X),Y)N, (8.3) 

which decomposes the vector field Dy Y into its tangential and normal components. 


Theorem 8.1. Let M be an oriented surface in R?, V its Riemannian connection, R 
the curvature operator of V, and L the shape operator on M. For X,Y,Z © X(M), 


(i) (Gauss curvature equation) 
R(X,Y)Z = (L(Y), Z)L(X) — (U(X), Z) L(Y); 
(ii) (Codazzi—Mainardi equation) 
VxL(Y) — VyL(X) —L([X,¥]) = 0. 
Proof. (i) Decomposing DyZ into its tangential and normal components, one has 


DyZ= (DyZ)tan 7 (DyZ) nor 
=VyZ+(L(Y),Z)N. (8.4) 


Hence, 
Dx DyZ = DxVyZ + Dx ((L(Y),Z)N) 
= VxVyZ+ (L(X), VyZ)N +X (L(Y), Z)N + (L(Y), Z)DxN. 
(by (8.4) and Leibniz rule) 
= VyVyZ— (L(X),Z)L(X) + (L(X), VyZ)N+X(L(Y),Z)N. (8.5) 
Interchanging X and Y gives 
DyD,Z = VyVxZ— (L(X),Z)L(Y) + (L(Y), VxZ)N + Y(L(X),Z)N. (8.6) 


By (8.4), 
Dy y|Z = VixyjZ+ (L([X,Y]),Z)N. (8.7) 


Subtracting (8.6) and (8.7) from (8.5) gives 
0 = Rp(X,Y)Z = R(X,Y)Z— (L(Y), Z)L(X) + (L(X),Z)L(¥) 


+ normal component. (8.8) 
The tangential component of (8.8) gives 
R(X,Y) = (L(Y), Z)L(X) — (L(X),Z) L(Y), 


which is the Gauss curvature equation. 
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(ii) The normal component of (8.8) is 
((L(X), VyZ) + X(L(Y),Z) 
— (L(Y), VxZ) — ¥(L(X),Z) — (L([X,¥]),Z))N =0. 
By the compatibility of V with the metric, this simplifies to 
(VxL(Y),Z) — (VyL(X),Z) — (L([X,¥]),Z) = 0. 


Since the equation above is true for all Z, by the nondegeneracy of the inner 
product, 


VyL(Y) — VyL(X) — L([X,Y]) =0. 


Remark 8.2. By the generalization of Proposition 7.28 to F-trilinear maps, because 
R(X, Y)Z is ¥-trilinear in X, Y, and Z, for each p € M, there is a unique R-trilinear 
map 

Ry: Tp>M x T,M x T,M — T,M 
such that Rp(Xp,Yp)Zp = (R(X,Y)Z)p. Thus, although we have stated the Gauss 
curvature equation for vector fields, it is also true for vectors X,,Y,,Zp € T,M ata 
point p. 


8.2 A Proof of the Theorema Egregium 


Suppose for every Riemannian manifold M, there is defined a function fy: M— R. 
The function fy is said to be an isometric invariant if for every isometry p: M > M 
of Riemannian manifolds, we have fu(p) = f(@(p)) for all p € M. 


Theorem 8.3 (Theorema Egregium). Let M be a surface in R? and p a point in M. 


(i) If e),é2 is an orthonormal basis for the tangent plane T,M, then the Gaussian 
curvature K, of M at p is 


Ky = (Rp(e1,€2)€2,€1)- (8.9) 
(ii) The Gaussian curvature is an isometric invariant of smooth surfaces in R?. 


Proof. (i) In Corollary 5.7 we found a formula for the Gaussian curvature K, in 
terms of the shape operator L and an orthonormal basis e}, e2 for T,,M: 


Kp = (L(e1),€1) (L(e2),€2) — (L(e1), 2) (L(e2),e1)- 
By the Gauss curvature equation, 
Rp(e1,e2)e2 = (L(e2), e2)L(e1) — (L(e1), e2)L(e2). 
Taking the inner product with e; gives 
(Rp(e1,€2)e2,e1) = (L(e2), 2) (L(e1),e1) — (L(e1), €2) (L(e2),€1) 
= Ky. 


(ii) Since R,(e1,e2) is determined completely by the metric, by Formula (8.9) the 
same can be said of K,. A detailed proof is left to Problem 8.3. 
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Gauss’s Theorema Egregium has some practical implications. For example, since 


a sphere and a plane have different Gaussian curvatures, it is not possible to map any 
open subset of the sphere isometrically to an open subset of the plane. Since the 
earth is spherical, this proves the impossibility of making a truly accurate map of 
any region of the earth, no matter how small. 


8.3 The Gaussian Curvature in Terms of an Arbitrary Basis 


The Theorema Egregium gives a formula for the Gaussian curvature of a surface in 
terms of an orthonormal basis for the tangent plane at a point. From it, one can derive 
a formula for the Gaussian curvature in terms of an arbitrary basis. 


Proposition 8.4. Let M be a smooth surface in R? and p € M. If u,v is any basis for 
the tangent plane T,,M, then the Gaussian curvature at p is 

(Rp(u, v)y, u) 
(u, u) (y, v) _ (u, vy? 


Proof. Let e,,e2 be an orthonormal basis for T,M and suppose 


Kp = 


u= ae, +beo, 


v= cei + de. 
By the skew-symmetry of the curvature tensor, 
R,(u,v) = (ad — bce) Rp(e1,€2), 

so that 

(Rp(u, v)v,u) = (ad — be)? (Rp(e1,€2)e2, 1). 
On the other hand, 

(u, u) (y, v) _ (u, vy? 

= (a’ +b’) (c? +d’) — (ac+ bd) = (ad — be)’. 

Hence, 
(Rp(u,v)v, u) 

(u, u) (y, v) = (u, v)? , 


Kp = (Rp(e1,€2)€2,€1) = 


Problems 

8.1. Affine connection under a diffeomorphism 

A diffeomorphism @: M — M of smooth manifolds induces an isomorphism 
Q..: ¥(M) > X(M) 


of their Lie algebras of vector fields. (In particular, @,([X,¥Y]) = [@.X,@.Y].) Suppose M has 
an affine connection V. For X,Y € X(M), define the vector field Vy¥ € %(M) by 


Q(VxY) = Vo.x (QxY). 
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(a) Show that for all f € C*(M), 9.(fX) = (fo @!)@,X. 
(b) Show that V: ¥(M) x X(M) > X(M) is an affine connection on M. 


8.2. Riemannian connection and curvature under an isometry 


(a) Now suppose that @: M — M is an isometry of Riemannian manifolds. Show that if V 
is the Riemannian connection on M, then the connection defined in Problem 8.1 is the 
Riemannian connection on M. 

(b) Let R and R be the curvature tensors of V and V, respectively. Show that for all X,Y, 
ZEX(M), 

@x(R(X,Y)Z) = R(@X, PY) QZ. 


8.3. Gaussian curvature is an isometric invariant 
Suppose @: M — M is an isometry of smooth surfaces in R?. Show that the Gaussian curva- 
ture of M at a point p € M is equal to the Gaussian curvature K of M at @(p): Ky = Kate: 
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§9 Generalizations to Hypersurfaces in R"t! 


Much of what we have done for surfaces in R° in the preceding sections generalizes 
readily to higher dimensions. We now carry this out. 


9.1 The Shape Operator of a Hypersurface 


A hypersurface M in R"*! is a submanifold of codimension 1. Although all of our 
hypersurfaces are by definition smooth, sometimes for emphasis we adopt the red- 
undant locution “smooth hypersurface.” Assume that there is a smooth unit normal 
vector field N on M; note that this is always possible locally on any hypersurface. By 
Proposition 4.3 we know that the directional derivative D on R’*! is the Riemannian 
connection of R’*!, 

For any point p € M and tangent vector X, € T,M, since (N,N) = 1, 


0 = X,(N,N) = 2(Dx,N,N). 


Therefore, Dy,N is tangent to M. The shape operator L,: T,M — T,M is defined 
to be 
Ly (Xp) = —Dx,N for Xp € T,M. 

Recall that *(M) is the vector space of C* vector fields and F = C”(M) the ring 
of C® functions on M. As the point p varies over M, the shape operator globalizes to 
an F-linear map L: X(M) + X(M) given by L(X), = Lp(Xp). 

Proposition 9.1. Let X,Y € X(M) be C® vector fields on an oriented hypersurface 
M inR"*!. Then 

(i) (L(X),¥) = (Dx¥,). 

(ii) the shape operator is self-adjoint with respect to the Euclidean metric inherited 

from R"*!; 

(L(X),Y) = (X, L(Y). 


Proof. Since (Y,N) = 0, by the compatibility of D with the metric, 
0 =X(Y,N) = (DyY,N) + (Y,DyN). 


Hence, 
(DxY,N) = (Y,—DxN) = (Y,L(X)). (9.1) 
Reversing the roles of X and Y gives 
Since [X,Y] is tangent to M, 
(X,Y],N) =0. (9.3) 


By torsion-freeness, subtracting (9.2) and (9.3) from (9.1) gives 


0 = (DyY — DyX — [X,Y],N) = (Y,L(X)) — (X,L(¥)). 
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Since the shape operator L,: T,M — T,M is self-adjoint, all of its eigenvalues 


A1,.--;An are real. These are the principal curvatures of the hypersurface M at p. 
We define the mean curvature of M at p to be the average of the principal curvatures: 
a, n 
-S Ai =- * tL 
on 


i=1 


and the Gaussian curvature at p to be the determinant of the shape operator: 
n 
p) =[]4 =det(L,). 
i=l 


9.2 The Riemannian Connection of a Hypersurface 


We learned earlier that the unique Riemannian connection of R’*! is the directional 
derivative D. 


Theorem 9.2. Let M be a smooth hypersurface in R"*! and D the directional deriva- 
tive on R"*!. For X,Y € X(M), the tangential component of DxY defines the 
Riemannian connection V of M: 


VxY = (DyxY )tan- 
Proof. Since it is evident that V satisfies the two defining properties of a connection, 
it suffices to check that VxY is torsion-free and compatible with the metric. 
1) Torsion-freeness: Let Tp and Ty be the torsions of D and V, respectively. By 
definition, for X,Y € X(M), 
DxY = VxY =F (DxY)nor; (9.4) 
DyX =VyX + (DyX)nor- (9.5) 
Since D is torsion-free, 
DyY — DyX = [X,Y]. 
Equating the normal components of both sides, we get (Dx Y )nor — (DyX ) nor = 0. 
Therefore, by (9.4) and (9.5), 
0 = Tp(X,Y) = DyY — DyX — [X,Y] 
= (VxY VyX [X, Y]) (DxY ) nor as (DyX ) nor 
= VyY¥ —VyX — [X,Y] =W(X,Y). 


2) Compatibility with the metric: For X,Y,Z € X(M), 
X(Y,Z) = (DxY,Z) + (¥,DxZ) 
= (VxY + (DxY )nor,Z) + (Y, VxZ+ (Vx )nor) 
= (VxY,Z) + (Y, VxZ). 


Therefore, VyY is the Riemannian connection on E. 
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9.3 The Second Fundamental Form 


At each point p of an oriented hypersurface M C R"*!, there is a sequence of sym- 
metric bilinear maps on 7,,M, called the first, second, and third fundamental forms, 
and so on: 


I(Xp,¥p) = (Xp,Yp), 
I(Xp,¥p) = (L(Xp), Yp) = (Xp,L(¥p)), 
T(Xp,¥p) = (L? (Xp), ¥p) = (L(Xp), L(¥p)) = (Xp L? (Xp), --- 


For X,Y € X(M), the directional derivative DyY decomposes into the sum of a 
tangential component and a normal component. The tangential component (DyY)tan 
is the Riemannian connection on M; the normal component (Dx )nor is essentially 
the second fundamental form. 


Proposition 9.3. [f N is a smooth unit normal vector field on the hypersurface M in 
R"*! and X,Y € X(M) are smooth vector fields on M, then 


(DxY¥ nor = U(X, Y)N 
Proof. The normal component (DyY )no; is a multiple of N, so 


DxY = (DxY )tan + (DxY ) nor 
=VxY+AN 


for some A € R. Taking the inner product of both sides with N gives 
(DyY,N) =A(N,N) =A. 
Hence, 


(DxY ) nor = —_ (DxY, N)N 


= (L(X),Y)N 
= II(X,Y)N. 


9.4 The Gauss Curvature and Codazzi-Mainardi Equations 


Let M be an oriented, smooth hypersurface in R’*!, with a smooth unit normal vector 
field N. The relation between the Riemannian connection D on Rt! and the Rie- 
mannian connection on M implies a relation between the curvature Rp(X,Y)Z =0 
on R"*! and the curvature R(X,¥)Z on M. The tangential component of this relation 
gives the Gauss curvature equation, and the normal component gives the Codazzi- 
Mainardi equation. 


Theorem 9.4. /f M is an oriented, smooth hypersurface in R"*! and X ,Y,Z € X(M), 
then 
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(i) (the Gauss curvature equation) 

R(X,Y) = (L(Y),Z)L(X) — (L(X),Z) L(Y). 
(ii) (the Codazzi—Mainardi equation) 


VyL(Y) — VyL(X) —L([X,¥]) =0. 


Proof. The proof is identical to that of Theorem 8.1. 


Corollary 9.5. Jf M is a smooth oriented hypersurface in R"*! with curvature R and 
X,Y € X(M), then 


(R(X,Y)Y,X) =II(X,X)I(Y,Y) — (X,Y). 
Proof. By the Gauss curvature equation, 


(R(X,Y)Y,X) = (L(Y), ¥)(L(X),X) — (L(X),¥) (L(Y),X) 
= II(X,X)I(Y,Y) —I(x,Y)’. 


When M is a surface in R? and X, and Y, form an orthonormal basis for 7,M, 
the quantity on the right of the corollary is the Gaussian curvature of the surface at p 
(see Corollary 5.7(ii) and Theorem 8.3(i)). 


Chapter 2 


Curvature and Differential Forms 


In Chapter 1 we developed, in terms of vector fields, the classical theory of curvature 
for curves and surfaces in R*. There is a dual approach using differential forms. 
Differential forms arise naturally even if one is interested only in vector fields. For 
example, the coefficients of tangent vectors relative to a frame on an open set are 
differential 1-forms on the open set. Differential forms are more supple than vector 
fields: they can be differentiated and multiplied, and they behave functorially under 
the pullback by a smooth map. In the 1920s and 30s Elie Cartan pioneered the use 
of differential forms in differential geometry [4], and these have proven to be tools 
of great power and versatility. 

In this chapter, we redevelop the theory of connections and curvature in terms 
of differential forms. First, in Section 10 we generalize the notion of a connection 
from the tangent bundle to an arbitrary vector bundle. Then in Section 11 we show 
how to represent connections, curvature, and torsion by differential forms. Finally, 
in a demonstration of their utility, in Section 12 we use differential forms to reprove 
Gauss’s Theorema Egregium. 


$10 Connections on a Vector Bundle 


The affine connection generalizes the directional derivative from R” to an arbitrary 
manifold, but does not include the case of the directional derivative of a vector field 
along a submanifold of IR”. For that, we need the notion of a connection on a vector 
bundle. An affine connection on a manifold is simply a connection on the tangent 
bundle. Because of the asymmetry of the two arguments in a connection Vys on a 
vector bundle, the torsion is no longer defined, but the curvature still makes sense. 
We define a Riemannian metric on a vector bundle, so that a Riemannian metric 
on a manifold becomes a Riemannian metric on the tangent bundle. Compatibility 
of a connection with the metric again makes sense. However, the lack of a concept 
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of torsion means that it is no longer possible to single out a unique connection on a 
vector bundle such as the Riemannian connection on a Riemannian manifold. 

A connection on a vector bundle turns out to be a local operator, and like all local 
operators, it restricts to any open subset (Theorem 7.20). 


10.1 Connections on a Vector Bundle 


Definition 10.1. Let E > M be aC™® vector bundle over M. A connection on E isa 
map 
V: X(M) x T(E) > T(E) 


such that for X € X(M) ands € T(E), 


(i) Vxs is ¥-linear in X and R-linear in s; 
(ii) (Leibniz rule) if f is a C™ function on M, then 


Vx (fs) = (Xf)s+ fVxs. 
Since X f = (df)(X), the Leibniz rule may be written as 
Vx (fs) = (df)(X)s + fVxs, 


or, suppressing X, 

Vi fs) =df-st+fVs. 
Example. An affine connection on a manifold M is a connection on the tangent 
bundle TM — M. 


Example 10.2. Let M be a submanifold of R” and E = TR"|y the restriction of the 
tangent bundle of R” to M. In Section 4.5, we defined the directional derivative 


D: ¥(M) x T(TR"|y) 3 T(TR"|y). 
By Proposition 4.9, it is a connection on the vector bundle TR"|y. 
We say that a section s € T(E) is flat if Vxs =0 for all X € X(M). 


Example 10.3 (Induced connection on a trivial bundle). Let E be a trivial bundle 
of rank r over a manifold M. Thus, there is a bundle isomorphism ¢: E > M x R’, 


called a trivialization for E over M. The trivialization @: E  M x R" induces 


a connection on E as follows. If v,,...,v, is a basis for R’, then s;: p++ (p,v;), 
i= 1,...,r, define a global frame for the product bundle M x R” over M, and 
e; = !os;,i=1,...,r, define a global frame for E over M: 


g-! 
<= MxR’ 
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So every section s € [(£) can be written uniquely as a linear combination 
s= Yilei, hic F. 


We can define a connection V on E by declaring the sections e; to be flat and 
applying the Leibniz rule and R-linearity to define Vxs: 


Vxs=Vx( Shei) => (Xhie. (10.1) 
Exercise 10.4. Check that (10.1) defines a connection on the trivial bundle E. 


The connection V on a trivial bundle induced by a trivialization depends on the 
trivialization, for the flat sections for V are precisely the sections of E corresponding 
to the constant sections of M x R’ under the trivialization. 


10.2 Existence of a Connection on a Vector Bundle 


In Section 10.1 we defined a connection on a vector bundle and exhibited a con- 
nection on a trivial bundle. We will now show the existence of a connection on an 
arbitrary vector bundle. 

Let V° and V! be two connections on a vector bundle E over M. By the Leibniz 
rule, for any vector field X € X¥(M), section s € T(E), and function f € C”(M), 


Vy (fs) = (Xf)s+ fVys, (10.2) 
Vx (fs) = (Xf)s+ fVxs. (10.3) 

Hence, 
(Vo +. VL) (fs) =2(Xf)st+f(VE +Vb)s. (10.4) 


Because of the extra factor of 2 in (10.4) the sum of two connections does not satisfy 
the Leibniz rule and so is not a connection. However, if we multiply (10.2) by 1 —t 
and (10.3) by f, then (1 — t)Vo. +1tV} satisfies the Leibniz rule. More generally, the 
same idea can be used to prove the following proposition. 


Proposition 10.5. Any finite linear combination ¥ t;\V' of connections V' is a connec- 
tion provided the coefficients add up to 1, that is, }\t; = 1. 


A finite linear combination whose coefficients add up to | is called a convex 
linear combination. Using Proposition 10.5, we now prove the existence of a 
connection. 


Theorem 10.6. Every C~ vector bundle E over a manifold M has a connection. 


Proof. Fix a trivializing open cover {U,,} for E and a partition of unity {Po} subor- 
dinate to {Ug}. On each Ug, the vector bundle E|y, is trivial and so has a connection 
V% by Example 10.3. For X € X(M) and s € T(E), denote by sq the restriction of s 


to Ug, and define ' 
Vxs= > pal Zsa. 
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Although this looks like an infinite sum, it is in fact a finite sum in a neighbor- 
hood of each point p in M, for by the local finiteness of {supp Pq}, there is an open 
neighborhood U of p that intersects only finitely many of the sets supppq. This 
means that on U all except finitely many of the P,’s are zero and so Y paV¥Sq is a 
finite sum on U with ¥ pq = 1. 

By checking on the open set U, it is easy to show that Vxs is F-linear in X, is 
R-linear in s, and satisfies the Leibniz rule. Hence, V is a connection on E. 


Note the similarity of this proof to the proof for the existence of a Riemannian 
metric on a vector bundle (Theorem 10.8). For a linear combination of connections 
to be a connection, we require the coefficients to sum to 1. On the other hand, for 
a linear combination of Riemannian metrics to be a Riemannian metric, the require- 
ment is that all the coefficients be nonnegative with at least one coefficient positive. 


10.3 Curvature of a Connection on a Vector Bundle 


The concept of torsion does not make sense for a connection on an arbitrary vector 
bundle, but curvature still does. It is defined by the same formula as for an affine 
connection: for X,Y € X(M) and s € T(E), 


R(X,Y)s = VxVys Vy Vxs Vixy]5 T(E). 
So R is an R-multilinear map 
X(M) x ¥(M) x T(E) > T(E). 


As before, R(X,Y)s is F-linear in all three arguments and so it is actually defined 
pointwise. Moreover, because Rp(Xp,Y,) is skew-symmetric in X, and Y,, at every 
point p there is an alternating bilinear map 


Rp: TpyM x T,M — Hom(E,, Ep) =: End(Ep) 


into the endomorphism ring of E,. We call this map the curvature tensor of the 
connection V. 


10.4 Riemannian Bundles 


We can also generalize the notion of a Riemannian metric to vector bundles. Let 
E — M be a C™ vector bundle over a manifold M. A Riemannian metric on E 
assigns to each p € M an inner product ( , ), on the fiber E,; the assignment is 
required to be C™ in the following sense: if s and t are C™ sections of E, then (s,t) is 
a C™ function on M. 

Thus, a Riemannian metric on a manifold M is simply a Riemannian metric on 
the tangent bundle 7M. A vector bundle together with a Riemannian metric is called 
a Riemannian bundle. 
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Example 10.7. Let E be a trivial vector bundle of rank 7 over a manifold M, with 
trivialization ¢: E ~ M xR’. The Euclidean inner product ( , )r on R” induces 
a Riemannian metric on E via the trivialization @: if u,v € E,, then the fiber map 
op: Ep — R’ is a linear isomorphism and we define 


(u, v) = (bp(u), bp(v))Rr. 
It is easy to check that ( , ) is a Riemannian metric on E. 


The proof of Theorem 1.12 generalizes to prove the existence of a Riemannian 
metric on a vector bundle. 


Theorem 10.8. On any C® vector bundle m: E — M, there is a Riemannian metric. 


Proof. Let {Ua, Qa: Eluy ~ Ua x R*} be a trivializing open cover for E. On Ely, 
there is a Riemannian metric ( , )g induced from the trivial bundle Ug x R*. Let 
{Pa} be a partition of unity on M subordinate to the open cover {Uq}. By the same 
reasoning as in Theorem 1.12, the sum 


(a= Spek xe 


is a finite sum in a neighborhood of each point of M and is a Riemannian metric 
on E. 


10.5 Metric Connections 


We say that a connection V on a Riemannian bundle E is compatible with the metric 
if for all X € X(M) and s,t € T(E), 


X (s,t) = (Vyxs,t) + (s,Vxt). 


A connection compatible with the metric on a Riemannian bundle is also called a 
metric connection. 


Example. Let E be a trivial vector bundle of rank r over a manifold M, with trivial- 
ization @: E ~ M x R’. We showed in Example 10.3 that the trivialization induces a 
connection V on E and in Example 10.7 that the trivialization induces a Riemannian 
metric (, ) on E. 


Proposition 10.9. On a trivial vector bundle E over a manifold M with trivialization 
g: E + M xR’, the connection V on E induced by the trivialization @ is compatible 
with the Riemannian metric ( , ) on E induced by the trivialization. 


Proof. Let v1,...,v, be an orthonormal basis for R’ and e;,...,e, the corresponding 
global frame for E, where e;(p) = @~!(p,v;). Then e1,...,@, is an orthonormal flat 
frame for E with respect to the Riemannian metric and the connection on EF induced 
by @. If s = Ya'e; and t = Ybie; are C™ sections of FE, then 


X(s,t) =X ey a'b') (because e1,...,é, are orthonormal) 
= \i(Xa')b' + Ya‘ xp! 
= (Vys,t)+(s,Vxt) (see (10.1)). 
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Example 10.10 (Connection on TIR"|y). If M is a submanifold of IR”, the Euclidean 
metric on R” restricts to a Riemannian metric on the vector bundle TR”|y. Sections 
of the vector bundle TIR"|,y are vector fields along M in R”. As noted in Example 
10.2, the directional derivative in R” induces a connection 


D: ¥(M) x T(TR"|y) > P(TR"|y). 


Proposition 4.10 asserts that the connection D on TR"|y has zero curvature and is 
compatible with the metric. 

The Gauss curvature equation for a surface M in R?, a key ingredient of the proof 
of Gauss’s Theorema Egregium, is a consequence of the vanishing of the curvature 
tensor of the connection D on the bundle TR? |v (Theorem 8.1). 


Lemma 10.11. Let E — M be a Riemannian bundle. Suppose V',...,V'™ are con- 
nections on E compatible with the metric and ay,...,d, are C™ functions on M that 
add up to 1. Then V = ¥a;V' is a connection on E compatible with the metric. 


Proof. By Proposition 10.5, V is a connection on E. It remains to check that V is 
compatible with the metric. If X € X(M) and s,t € T(E), then 


X(s,t) = (Vies,t) + (s, Vit) (10.5) 


for all i because V! is compatible with the metric. Now multiply (10.5) by a; and 
sum: 


X(s,t) = > aiX (s,t) 
= (YaiVxs,t) + (s,¥ aiVxt) 


= (Vxs,t) + (s, Vxt). 


Proposition 10.12. On any Riemannian bundle E — M, there is a connection com- 
patible with the metric. 


Proof. Choose a trivializing open cover {Uq} for E. By Proposition 10.9, for each 
trivializing open set Ug of the cover, we can find a connection V% on E|y,, compatible 
with the metric. Let {pq} be a partition of unity subordinate to the open cover {Ug}. 
Because the collection {suppP«} is locally finite, the sum V = ).pqV® is a finite 
sum in a neighborhood of each point. Since }' py = 1, by Lemma 10.11, V is a 
connection on E compatible with the metric. 


10.6 Restricting a Connection to an Open Subset 
A connection V on a vector bundle E over M, 
V: X(M) x T(E) > T(E) 


is F-linear in the first argument, but not F-linear in the second argument. However, it 
turns out that the F-linearity in the first argument and the Leibniz rule in the second 
argument are enough to imply that V is a local operator. 
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Proposition 10.13. Let V be a connection on a vector bundle E over a manifold M, 
X be a smooth vector field on M, and s a smooth section of E. If either X or s 
vanishes identically on an open subset U, then Vxs vanishes identically on U. (see 
Problem 7.2.) 


Proof. Suppose X vanishes identically on U. Since Vxs is F-linear in X, by Propo- 
sition 7.17, for any s € T(E), Vxs vanishes identically on U. 

Next, suppose s = 0 on U and p € U. Choose a C™ bump function f such that 
f = 1 in a neighborhood of p and supp f C U. By our choice of f, the derivative 
X,f is zero. Since s vanishes on the support of f, we have fs = 0 on M and so 
Vx (fs) = 0. Evaluating at p gives 


0= (Vx(f3))p = (XS) p5p + f(P)(Vx5)p = (Vx5)p- 


Because p is an arbitrary point of U, we conclude that Vxs vanishes identically on U. 


In the same way that a local operator: T(E) + T(F) can be restricted to any open 
subset (Theorem 7.20), a connection on a vector bundle can be restricted to any open 
subset. More precisely, given a connection V on a vector bundle EF, for every open 
set U there is a connection 


VW’: ¥(U) x T(U,E) + T(U,E) 
such that for any global vector field X € X(M) and global section 5 € T(E), 
V¥iy Slu) = (Vxs)lu- 


Suppose X € X(U) and s € T(U,E). For any p € U, to define V¥s € T(U,E) 
first pick a global vector field X and a global section § € T'(£) that agree with X and 
s in a neighborhood of p. Then define 


(V¥s)(p) = (Vz5)(p). (10.6) 


Because V5 is a local operator in X and in 5, this definition is independent of the 
choice of X and §. It is a routine matter to show that V” satisfies all the properties of 
a connection on E|y (Problem 10.1). 


10.7 Connections at a Point 


Suppose V is a connection on a vector bundle E over a manifold M. For X € ¥(M) 
and s € T(E), since Vys is ¥-linear in X, it is a point operator in X and Proposi- 
tion 7.25 assures us that it can be defined pointwise in X: there is a unique map, also 
denoted by V, 

V: 1,MxIT(E) > E, 
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such that if X € X(M) and s € T(E), then 


Vs = (Vx5)p. 
It is easy to check that Vx,s has the following properties: for X, € T,M and 
sET(E£), 


(i) Vx,5 is R-linear in Xp and in s; 
(ii) if f is a C™ function on M, then 


Vx, (fs) = (Xpf)s(p) + f(P)Vx,5- 
Problems 


10.1. Restriction of a connection to an open subset 
Show that the restriction of a connection to an open subset U given by (10.6) defines a con- 
nection on Ely. 


10.2. Restriction of a Riemannian connection 
Let U be an open subset of a Riemannian manifold M. Prove that if V is the Riemannian 
connection on M, then the restriction VY is the Riemannian connection on U. 


10.3. Agreement on a curve 

Given a connection V on a vector bundle E — M, a point p in M, and a tangent vector Xp in 
TM, show that if two sections s and t of E agree on a curve through p in M with initial vector 
Xp, then Vx, 8 = Vx,t- 
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$11 Connection, Curvature, and Torsion Forms 


According to Gauss’s Theorema Egregium, if R, is the curvature tensor at a point p 
of a surface M in R? and u,v is any orthonormal frame for the tangent plane T,M, 
then the Gaussian curvature of the surface at p is 


Ky = (Rp(u,v)v, U). (11.1) 


Since this formula does not depend on the embedding of the surface in R?, but only 
on the Riemannian structure of the surface, it makes sense for an abstract Riemannian 
2-manifold and can be taken as the definition of the Gaussian curvature at a point of 
such a surface, for example, the hyperbolic upper half-plane HI* (Problem 1.7). To 
compute the Gaussian curvature from (11.1), one would need to compute first the 
Riemannian connection V using the six-term formula (6.8) and then compute the 
curvature tensor R,(u,v)v—a clearly nontrivial task. 

One of the great advantages of differential forms is its computability, and so in 
this section we shall recast connections, curvature, and torsion in terms of differen- 
tial forms. This will lead to a simple computation of the Gaussian curvature of the 
hyperbolic upper half-plane in the next section. 

Relative to a frame for a vector bundle, a connection on the bundle can be repre- 
sented by a matrix of 1-forms, and the curvature by a matrix of 2-forms. The relation 
between these two matrices is the structural equation of the connection. 

The Gram—Schmidt process in linear algebra turns any C™ frame of a Riemannian 
bundle into an orthonormal frame. Relative to an orthonormal frame, the connection 
matrix of a metric connection is skew-symmetric. By the structural equation, the cur- 
vature matrix is also skew-symmetric. The skew-symmetry of the curvature matrix 
of a metric connection will have important consequences in a later chapter. 

On a tangent bundle, the torsion of a connection can also be represented by a 
vector of 2-forms called the torsion forms. There is a structural equation relating the 
torsion forms to the dual forms and the connection forms. 


11.1 Connection and Curvature Forms 


Let V be a connection on a C™ rank r vector bundle 7: E — M. We are interested 
in describing V locally. Section 10.6 shows how on every open subset U of M, V 
restricts to a connection on E|y > U: 


VW!» X(0) x TU, £) oT (U,£). 


We will usually omit the superscript U and write VY as V. 

Suppose U is a trivializing open set for E and e),...,e, is a frame for E over 
U (Proposition 7.22), and let X € X(U) be a C® vector field on U. On U, since 
any section s € '(U,£) is a linear combination s = Yale;, the section Vxs can be 
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computed from Vye; by linearity and the Leibniz rule. As a section of E over U, 
Vxe; is a linear combination of the e;’s with coefficients « depending on X: 


Vxej = > 05 (X ej. 
The J-linearity of Vye; in X implies that OF is F-linear in X and so OF is a 1-form 
on U (Corollary 7.27). The 1-forms OF on U are called the connection forms, and 
the matrix @ = [@!] is called the connection matrix, of the connection V relative to 


J 
the frame e),...,e, on U. 


Similarly, for X,Y € X(U), the section R(X,Y)e; is a linear combination of 
€],...,€r: ; 
R(X, Ye; = YQi(X,V)e;. 
Since 
R(X, Y) =VxVy —VyVx — Vix.y] 
is alternating and is F-bilinear, so is Qi. By Section 7.8, Qi is a 2-form on U. The 
2-forms Qi are called the curvature forms, and the matrix Q = [Q'] is called the 
curvature matrix, of the connection V relative to the frame e;,...,e, on U. 
Recall that if @ and B are C® 1-forms and X and Y are C® vector fields on a 
manifold, then 
(@AB)(X,Y) = a(X)B(Y) — a(¥)B(X) (11.2) 
and 
(da)(X,Y) =Xa(Y)—Ya(x) —a([x,Y]) (11.3) 
((21, Section 3.7] and [21, Prop. 20.13]). 


Theorem 11.1. Let V be a connection on a vector bundle E — M of rank r. Relative 
to a frame e,...,e, for E over a trivializing open set U, the curvature forms Qi are 


related to the connection forms OF by the second structural equation: 
i _ ai ERC ASk 
= aO;t DANG: 


Remark 11.2 (The Einstein summation convention). In classical differential geome- 
try, itis customary to omit the summation sign }} whenever there is a pair of repeating 
indices, one a superscript and the other a subscript. This is called the Einstein sum- 
mation convention. For example, in the Einstein summation convention, aie; means 
¥; a@e;. We will sometimes adopt this convention if it simplifies the appearance of a 
proof without creating confusion. 


Proof (of Theorem 11.1). Let X and Y be smooth vector fields on U. Then 


VxVye; = Vx ¥ (a) (Vex) (definition of connection forms) 
k 
=> Xo} (Yer+ Yo (Y)Vxex (Leibniz rule) 
k k 


= YX o;(¥ ei + iO ee 
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Interchanging X and Y gives 


VyVxej = VY ai(X)ei+ ¥ w} (X) aj (Ve. 
i i,k 


Furthermore, 
Vixyjei= » a} ([X,¥])ei. 
Hence, in Einstein notation, 
R(X,Y)e; => VxVye; = Vy Vxe; — Vixy]ei 
= (X@i(Y) —Yoi(X) — 0} ([X,Y]))ei 
+ (@j(X) oF (Y) — @(Y)o§(X))e; 
= doi (X, Y)e;+ o}. x oo} (X,Y ej (by (11.3) and (11.2)) 
= (dai + @ A @})(X,Y)e;. 


Comparing this with the definition of the curvature form Qi gives 


ae a 
Dat DONG 


11.2 Connections on a Framed Open Set 


Suppose E is a C™ vector bundle over a manifold M and U is an open set on which 
there is a C™ frame e1,...,e, for E. We call U a framed open set for E for short. 
A connection V on E|y determines a unique connection matrix [oi] relative to the 


frame e;,...,é-. Conversely, any matrix of 1-forms [a5] on U determines a connec- 
tion on Ely as follows. 
Given a matrix [@/] of 1-forms on U, and X,Y € X(U), we set 


Vxej = > oi (X ei, 
and define VxY by applying the Leibniz rule to Y = She jl 


VxY = Vx (hie;) = (Xh/)ej +h! oi (Xe; 
= ((Xh') +h!) (X))e. (11.4) 


With this definition, V is a connection on E|y (Problem 11.2). 


11.3 The Gram-Schmidt Process 


The Gram-—Schmidt process in linear algebra turns any linearly independent set of 
vectors Vj,...,V, in an inner product space V into an orthonormal set with the same 
span. Denote by proj, b the orthogonal projection of b to the linear span of a. Then 
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(ba) 
a 
(a, a) 
To carry out the Gram—Schmidt process, we first create an orthogonal set 
W1,-+-,>Wn: 


proj a b = 


wi=Vi, 
W2 = v2 — proj, v2 
(v2,W1) 
(wii) 
W3 = V3— proj, V3 — proj, V3 (11.5) 
V¥3,W] V3,W2 
Hyp Py, — PE, 


and so on. 


Proposition 11.3. In the Gram—Schmidt process (11.5), for each k, the set w1,...,wk 


span the same linear subspace of V as V1,...,Vx. 

Proof. Let (w,,...,wg) be the linear subspace of V spanned by w),...,wg. From 
(11.5), it is clear that each v; is a linear combination of w,,...,w;. Hence, we have 
(Vi 62-5 Ve) C (Wy ++) Wk) 


We prove the reverse inequality by induction. The base case (w1) C (v1) is triv- 
ially true. Suppose (w1,...,wx_1) C (v1,---,V¥e-1). Then (11.5) shows that 


k-1 (v ‘ 
k wi) 
w=V%- > . w; © (V1,---5VE)- 
it (wi, Wi) 
Hence, (w1,..-,Wx) C (V1,---,VE)- 

Since v,...,vg are linearly independent, the subspace (v1,...,v,) has dimen- 
sion k. Thus, w1,...,w, is a basis for (v1,...,vg) = (W1,.--,w x). In particular, none 
of the vectors w1,...,w, is 0. It is easy to check that w),...,w, is an orthogonal set. 
To get an orthonormal set e),...,é@n, define 

Wi 
Ga 5, (11.6) 
I] will 

The Gram—Schmidt process can also be applied to a frame v),...,v, for a vector 
bundle over an open set U. We see from (11.5) and (11.6) that it is a C® process: if 
the sections v1,...,V, of E are C® over U, so are the orthonormal sections e1,..., @p. 


11.4 Metric Connection Relative to an Orthonormal Frame 


In the preceding subsections we saw that on a framed open set U for a vector bundle 
E — M, a connection is completely specified by a matrix [a] of 1-forms on U. 
Suppose now that the vector bundle E — M is endowed with a Riemannian metric. 
The defining property of a metric connection can be translated into a condition on 
the connection matrix [o'] relative to an orthonormal frame. 
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Proposition 11.4. Let E — M be a Riemannian bundle and V a connection on E. 


(i) If the connection V is compatible with the metric, then its connection matrix [co;] 
relative to any orthonormal frame e,,...,¢, for E over a trivializing open set 
U C M is skew-symmetric. 

(ii) If every point p € M has a trivializing neighborhood U for E such that the con- 


nection matrix {@'] relative to an orthonormal frame e,,...,e, for E over U is 


J 
skew-symmetric, then the connection V is compatible with the metric. 


Proof. (i) Suppose V is compatible with the metric. For all X € X(U) and i, j, 


0 => X (e;,e;) => (Vxei,e;) + (e;, Vxe;) 
= (oF (X)ex,e;) + (ci, OF (X)ex) 
— ok (X) xj + ow; (X) dix (6;; = Kronecker delta) 


=o} (X) + @/(X). 


Hence, 


(ii) We note first that compatibility with the metric is a local condition, so V 
is compatible with the metric if and only if its restriction V¥” to any open set U is 
compatible with the metric. Suppose @/ = —o;. Let s = Ya'e; andt = db/e;, with 
a,b! ©C°(U). Then 


=X (Yab') => (Xa')b'+ Vax’. 
Vxs= Vx (ale) = (Xa')e; +a'Vxe; 
= (Xa')e; +a'@7(X)er, 


(Vxs,t) =) (Xa')b' + ¥ ao} (X)b!, (11.7) 
(s, Vxt) =D (Xb!)a' + ¥ bia! (X)al 
=>) (Xb')a' + Yia'b!ai(X). (11.8) 
But by the skew-symmetry of @, 
¥ a'b! a) (X) + Y,a'b/ wi (X) = 


Hence, adding (11.7) and (11.8) gives 


(Vyxs,t) + (s,Vxt) = Yi (Xa')b' +> (Xb')a! 
= X(s,t). 


Proposition 11.5. [f the connection matrix [co'] relative to a frame é),...,€n of 
an affine connection on a manifold is skew-symmetric, then so is the curvature 
matrix [Q']. 
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Proof. By the structural equation (Theorem 11.1), 


a J k 
Q; = do; + @, \@; 


a do; + ( wo) A(—a@j)) (skew-symmetry of [«']) 
=-d wo; — aA Oy (anticommutativity of A) 
= -Qi. 


By Propositions 11.4 and 11.5, if a connection on a Riemannian bundle is com- 
patible with the metric, then its curvature matrix Q relative to an orthonormal frame 
is skew-symmetric. 


11.5 Connections on the Tangent Bundle 


A connection on the tangent bundle TM of a manifold M is simply an affine connec- 
tion on M. In addition to the curvature tensor R(X ,Y)Z, an affine connection has a 
torsion tensor T (X,Y). 

Let U be an open set in M on which the tangent bundle TM has a smooth 


frame ¢1,...,én. If U is a coordinate open set with coordinates x!,...,x”, then 
d/dx',...,0/Ax" is such a frame, but we will consider the more general setting 
where U need not be a coordinate open set. Let 6!,...,0” be the dual frame of 


1-forms on U; this means 
6'(e;) = 6}. 
Proposition 11.6. /fX is a smooth vector field on the open set U, then X =>, 0'(X)e;. 


Proof. Since e1,...,@, is a frame for the tangent bundle TM over U, 


X= Ya! ej 
for some C® functions a/ € C*(U). Applying 0! to both sides gives 


e'(X) = X6'(a’e;) a Ya/5i =a, 


J 


Therefore, 


X=) 6'(X)e;. 
For X,Y € X(U), the torsion T(X,Y) is a linear combination of the vector fields 
€1,---,@n, SO We Can write 


T(X,Y) =). 0'(X, Ye. 


Since T (X,Y) is alternating and $-bilinear, so are the coefficients t'. Therefore, the 
t'’s are 2-forms on U, called the torsion forms of the affine connection V relative to 
the frame e;,...,e, on U. 

Since the torsion and curvature forms are determined completely by the frame 
€1,---,@, and the connection, there should be formulas for tv! and Qi in terms of the 
dual forms and the connection forms. Indeed, Theorem 11.1 expresses the curvature 
forms in terms of the connection forms alone. 
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Theorem 11.7 (Structural equations). Relative to a frame e,,...,€, for the tangent 
bundle over an open set U of a manifold M, the torsion and curvature forms of an 
affine connection on M can be given in terms of the dual 1-forms and the connection 
forms: 

(i) (the first structural equation) t! = do‘ +5, ij OF A@s; 

(ii) (the second structural equation) Qi =d OF + Yi a} A Oo; 


Proof. The second structural equation (ii) is a special case of Theorem 11.1, which 
is true more generally for any vector bundle. The proof of the first structural equation 
(i) is a matter of unraveling the definition of torsion. Let X and Y be smooth vector 
fields on U. By Proposition 11.6, we can write 


Y=) 6/(Y)e;. 


Je; + 0/(Y)Vxe; (Leibniz rule) 


Je; +O4(¥ Jai (X)e; (definition of connection form). 


By symmetry, 
VyX = (YO/(X))e; + 0/(X)@5(Y e;. 


Finally, by Proposition 11.6 again, 
[X,Y] = 6'([X,Y¥])e;. 
Thus, 
T(X,Y) =VxY —VyX — [X,Y] 
= (((xe'”) —Y6'(X) — 6'([X,¥])) + (@i(x)e4(Y) — o}(¥)6!(x))) e; 
= (d0' + oA 0/)(X,Y ei, 


where the last equality follows from (11.3) and (11.2). Hence, 


t=d0'+> ai A!. 


We can now translate the two defining properties of the Riemannian connection 
into conditions on the matrix [@;]. 


Proposition 11.8. Let M be a Riemannian manifold and U an open subset on which 
there is an orthonormal frame é1,...,€n. Let @',...,0" be the dual frame of 1-forms. 
Then there exists a unique skew-symmetric matrix [ai] of 1-forms such that 


d0'+ > ai A@/=0 foralli=1,...,n. (11.9) 
j 
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Proof. In Theorem 6.6 we showed the existence of a Riemannian connection V on 
any manifold. Let [a‘] be the connection matrix of V relative to the orthonormal 
frame e),...,@, on U. Because V is compatible with the metric, by Proposition 11.4, 
the matrix [a5] is skew-symmetric. Because V is torsion-free, by Theorem 11.7, it 
satisfies 

Se =0 


This proves the existence of the matrix [a5] with the two required properties. 

To prove uniqueness, suppose [o'] is any skew-symmetric matrix of 1-forms on 
U satisfying (11.9). In Section 11.2, taking the vector bundle E to be the tangent bun- 
dle TM, we showed that [o'] defines an affine connection V on U of which it is the 
connection matrix relative to the frame e1,...,é,. Because [a5] is skew-symmetric, 
V is compatible with the metric (Proposition 11.4), and because [a5] satisfies the 
equations (11.9), V is torsion-free (Theorem 11.7). Thus, V is the unique Rieman- 
nian connection on U. 


If A = [a ‘| and B = [B; ‘| are matrices of differential forms on M with the number 
of columns of A equal to the number of rows of B, then their wedge product A A B is 
defined to be the matrix of differential forms whose (i, j)-entry is 


(AAB); = DN): 


and dA is defined to be [d a). In matrix notation, we write 


qT! g! 


Then the first structural equation becomes 
tT=d0+Q@A0 
and the second structural equation becomes 


Q=doa+aoana. 


Problems 


11.1. Connection and curvature forms on the Poincaré disk 
The Poincaré disk is the open unit disk 


D={z=x+iyeC| |z| <1} 
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in the complex plane with Riemannian metric 


_ A(dx@dx+dy®dy) — 4(dx®dx+dy@dy) 


(P=[z/?)? Cae? 


(5 )z 


An orthonormal frame for D is 


te) 1 
2 = 2 
a= 5(I-KeP)S, = 5(-PS. 
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Find the connection matrix @ = [o’] and the curvature matrix Q = [Qi] relative to the ort- 
honormal frame e1,e2 of the Riemannian connection V on the Poincaré disk. (Hint: First find 


the dual frame @!, 62. Then solve for wi in (11.9).) 


11.2. Connection defined by a matrix of 1-forms 
Show that (11.4) defines a connection on a vector bundle FE over a framed open set U. 
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In Section 8 we proved Gauss’s Theorema Egregium using vector fields. In this 
section we reprove the theorem, but using differential forms. An essential step is 
the derivation of a differential-form analogue of the Gauss curvature equation. The 
Theorema Egregium gives an intrinsic characterization of the Gaussian curvature of 
a surface, dependent only on the metric and independent of the embedding of the 
surface in a Euclidean space. This characterization can be taken as the definition of 
the Gaussian curvature of an abstract Riemannian 2-manifold. As an example, we 
compute the Gaussian curvature of the Poincaré half-plane. 


12.1 The Gauss Curvature Equation 


The Gauss curvature equation (Theorem 8.1) for an oriented surface M in R? relates 
the curvature tensor to the shape operator. It has an analogue in terms of differential 
forms. 

Consider a smooth surface M in R* and a point p in M. Let U be an open 
neighborhood of p in M on which there is an orthonormal frame e;, e2. This is always 
possible by the Gram—Schmidt process, which turns any frame into an orthonormal 
frame. Let e3 be the cross product e; x e2. Then e;,e2,e3 is an orthonormal frame 
for the vector bundle TR*|y over U. 

For the connection D on the bundle 7R?|,y, let [a5] be the connection matrix of 1- 
forms relative to the orthonormal frame e),e2,e3 over U. Since D is compatible with 
the metric and the frame e), 2, e3 is orthonormal, the matrix [a5] is skew-symmetric 
(Proposition 11.4). Hence, for X € X(M), 


Dxe\ = — @3(X)er — 03(X)es, (12.1) 
Dxex = @3(X)e1 — 03 (X)e3, (12.2) 
Dxe3 = 03 (X)e, + @3(X)er. (12.3) 


Let V be the Riemannian connection on the surface M. Recall that for X,Y € 
X(M), the directional derivative DyY need not be tangent to the surface M, and VxY 
is simply the tangential component (DyY)tan of DyY. By (12.1) and (12.2), 


Vxe1 = (Dyet)tan = —@3(X)er, 


Vrxe2 = (Dxe2)tan = oO; (X)e1. 


It follows that the connection matrix of the Riemannian connection V on M is 


o-| 9 @)_f 1] a 
~ |-@, 0] [-10] 7° 
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the curvature matrix of V is 


- _, _[ 0 do] _ [ 01), 4 
Q=do+ono=do=|_ 4, *|=|- | 204 


So the curvature matrix of V is completely described by 
Q} = do). (12.4) 


Set the unit normal vector field N on U to be N = —e3. By (12.3), the shape 
operator L is described by 


L(X) = —DxN = Dye3 = w4(X)e1 +. @3(X)en, X € X(M). (12.5) 


Theorem 12.1 (Gauss curvature equation). Let e;,¢e2 be an orthonormal frame of 
vector fields on an oriented open subset U of a surface M in R?, and let e3 be a 
unit normal vector field on U. Relative to e),e€2,€3, the curvature form Q3 of the 
Riemannian connection on M is related to the connection forms of the directional 
derivative D on the bundle TR?|y by 


QO} = o! Aa}. (12.6) 


We call formula (12.6) the Gauss curvature equation, because on the left-hand 
side, oF describes the curvature tensor of the surface, while on the right-hand side, 
oO} and w3 describe the shape operator. 


Proof. Let Qi be the curvature forms of the connection D on TR*|y. Because the 
curvature tensor of D is zero, the second structural equation for Q gives 


wi soak 

AO LENO, =), (12.7) 
In particular, 

da; +1 \@; +03 \ a5 +0} \ a3 =0. 
Since o} — wo; = 0, this reduces to 
da; +31 \ a} =0. 
Since the matrix [o'] is skew-symmetric, 
dos = 01 Na}. 


The Gauss curvature equation now follows from (12.4). 
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Keeping the same notations as in the preceding subsection, we will derive formulas 
for the Gaussian curvature of a surface in R?3, first in terms of the connection forms 
for the directional derivative and then in terms of the curvature form oe. 


Proposition 12.2. For a smooth surface in R®, if e,,e2 is an orthonormal frame over 
an oriented open subset U of the surface and e3 is a unit normal vector field on U, 
then the Gaussian curvature K on U is given by 

1 1 
3 (e1) 3 (e2) 


K=det] 2 


3 (e1) ice =~ (4 A @3)(e1,€2). 


Proof. From (12.5), 


Therefore, 


1 
K = detL = det ea 
003 ( 


= 03 (€1) 3 (ex) — @3 (€2) @§ (e1) = (3 A @3)(€1,€2). 


Theorem 12.3 (Theorema Egregium). For a smooth surface in R>, if e1,e2 is an 
orthonormal frame over an open subset U of the surface with dual frame 9! , 07, then 
the Gaussian curvature K on U is given by 


K =Q3(e1,e2) (12.8) 


or by 
do = Ko! Ae. (12.9) 


Proof. Formula (12.8) is an immediate consequence of Proposition 12.2 and the 
Gauss curvature equation (12.6). 
As for (12.9), since 


K = K(0! A 07)(e1,e2) = Q4(e1,e2) 


and a 2-form on U is completely determined by its value on e;,e2, we have Q3 = 
K0! \@7. By (12.4), 


doi = Ko! Ae. 
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Since Oo depends only on the metric and not on the embedding of the surface 
in R3, formula (12.8) shows that the same is true of the Gaussian curvature. We 
can therefore take this formula to be the definition of the Gaussian curvature of an 
abstract Riemannian 2-manifold. It is consistent with the formula in the first version 
(Theorem 8.3) of the Theorema Egregium, for by the definition of the curvature 
matrix, if e;,e2 is an orthonormal frame on an open subset U of M, then 

R(X,Y)ex =Q3(X,Y)e, forall X,Y € X(U). 
so that ‘ ‘ 
(R(e1,€2)e2,€1) = (Qy(e1,e2)e1,e1) = Q3(e1,€2). 


Definition 12.4. | The Gaussian curvature K at a point p of a Riemannian 2- 
manifold M is defined to be 


Kp = (Rp(u,v)v, u) (12.10) 
for any orthonormal basis u, v for the tangent plane 7,M. 


For the Gaussian curvature to be well defined, we need to show that for- 
mula (12.10) is independent of the choice of orthonormal basis. This we do in the 
next section. 


12.3 Skew-Symmetries of the Curvature Tensor 
Recall that the curvature of an affine connection V on a manifold / is defined to be 


R: ¥(M) x X(M) x &(M) > X(M), 
R(X,Y)Z = VxVyZ—VyVxZ— Vix yZ. 


We showed that R(X,Y)Z is ¥-linear in every argument; therefore, it is a point oper- 
ator. A point operator is also called a tensor. It is immediate from the definition that 
the curvature tensor R(X ,Y)Z is skew-symmetric in X and Y. 


Proposition 12.5. [f an affine connection V on a Riemannian manifold M is com- 
patible with the metric, then for vector fields X, Y, Z, W © X(M), the tensor 
(R(X,Y)Z,W) is skew-symmetric in Z and W. 


Proof. Because (R(X ,Y)Z,W) is a tensor, it is enough to check its skew-symmetry 
locally, for example, on an open set U on which there is a frame e1,...,@,. By 
Gram-—Schmidt, we may assume that the frame e,...,@, is orthonormal. Then 


(R(X, Y)e;,e;) = Qi (X,Y). 


Since V is compatible with the metric, its curvature matrix Q = [Q'] relative to an 
orthonormal frame is skew-symmetric. Therefore, 


(R(X, Y )e;,ei) = (X,Y) = -O/ (X,Y) = —(R(X,Y )ej,e;). 
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On U, we can write Z = ¥z'e; and W = ¥ w/e; for some C® functions z', w/ € C”(U). 
Then 


(R(X, Y)Z,W) 2 (X,Y )ei,e;) =— Vi z'w! (R(X, Y )e;,ei) 


We now show that (R(u,v)v,u) is independent of the orthonormal basis u,v for 
T,M. Suppose i, V is another orthonormal basis. Then 


for an orthogonal matrix A = ki ‘l , and 


(R(ii, 0)¥, a) = ((detA)R(u,v)(cu+dv),au+dv) 
= (detA)*(R(u,v)v,u) 


by the skew-symmetry of (R(u,v)z,w) in z and w. Since A € O(2), detA = +1. 
Hence, 
(RG, V)v,@) = (R(u, v)v,u). 


12.4 Sectional Curvature 


Let M be a Riemannian manifold and p a point in M. If P is a2-dimensional subspace 
of the tangent space 7,,M, then we define the sectional curvature of P to be 


K(P) = (R(e1,e2)e2,1) (12.11) 


for any orthonormal basis e;,e2 of P. Just as in the definition of the Gaussian cur- 
vature, the right-hand side of (12.11) is independent of the orthonormal basis e1, e2 
(cf§12.3). 

If u,v is an arbitrary basis for the 2-plane P, then a computation similar to that in 
§8.3 shows that the sectional curvature of P is also given by 


(R(u,v)v, u) 
(u,u)(v,v) — (u,v) ” 


K(P) = 


12.5 Poincaré Half-Plane 


Example 12.6 (The Gaussian curvature of the Poincaré half-plane). The Poincaré 
half-plane is the upper half-plane 


H? = {(x,y) € R? | y > 0} 
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with the metric: 
dx ® dx + dy ® dy 
(, (x) = — yp 


Classically, the notation for a Riemannian metric is ds*. Hence, the metric on the 
Poincaré half-plane is 


a dx ®dx+dy®dy 
y 
With this metric, an orthonormal frame is 


ds 


7] 
LS x? cam: 


So the dual frame is 


Hence, 


1 
d@'=—dxAdy, d0*=0. (12.12) 
y 
On the Poincaré half-plane the connection form O} is a linear combination of dx 
and dy, so we may write 
@ = adx-+ bay. (12.13) 


We will determine the coefficients a and b from the first structural equation: 


do! —-@} Ae’, (12.14) 
de? =-a Ae! =a} ral. (12.15) 


By (12.12), (12.13), and (12.14), 
1 1 1 a 
= dx \dy =d@° = —(adx + bdy) \ — dy = —~ dx dy. 
y y y 
Soa=-—1/y. By (12.12), (12.13), and (12.15), 
ae 1 b 
0 = d0° = (—-dx+bdy) \ —dx = —-—dx Ady. 
y y y 


So b = 0. Therefore, 
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By definition, the Gaussian curvature of the Poincaré half-plane is 


1 
K =Q3(e1,e2) = —plexAdy)ys955) 
Ce) 


Problems 


12.1. The orthogonal group O(2) 


(a) Show that an element A of the orthogonal group O(2) is either 


a ae 
where a2 +b* = 1, a,b ER. 


(b) Let SO(2) = {A € O(2) | detA = 1}. Show that every element A of SO(2) is of the form 


cost —sint 7 
A= , te 


a 


sint cost 


Thus, SO(2) is the group of rotations about the origin in R?. 


; | . Then O(2) = SO(2) USO(2)J. This proves that O(2) has two connected 


components, each diffeomorphic to the circle S!. 


Lets =| 


12.2. Gaussian curvature of the Euclidean plane 

Let e) = 0/Ax,e2 = 0 /dy be the standard orthonormal frame on R?. Compute the connection 
form os relative to e1,@2 of the Riemannian connection on R?2, and then compute the Gaussian 
curvature C. 


12.3. Gaussian curvature of a non-Euclidean plane 
Let h(x) be a C® positive function on R; thus, h(x) > 0 for all x € R. At each (x,y) € R?, 
define 

(, en) = dx @dx+h(x)? dy@ dy. 


(a) Show that (, ) is a Riemannian metric on R?. 
(b) Compute the Gaussian curvature of R? with this metric. 


12.4. Gaussian curvature of the Poincaré disk 
Compute the Gaussian curvature of the Poincaré disk defined in Problem 11.1. 


12.5. Gaussian curvature under a conformal map 
Let T: M — M' be a diffeomorphism of Riemannian manifolds of dimension 2. Suppose at 
each point p € M, there is a positive number a(p) such that 


(Tey, T.W) MT (p) = a(p)v, Ww) M,p 


for all u,v € Tp(M). Find the relation between the Gaussian curvatures of M and M’. 


Chapter 3 


Geodesics 


A geodesic on a Riemannian manifold is the analogue of a line in Euclidean space. 
One can characterize a line in Euclidean space in several equivalent ways, among 
which are the following: 


(1) A line is “straight” in the sense that it has a parametrization with a constant 
velocity vector field. 
(2) A line connecting two points gives the shortest distance between the two points. 


These two properties are not necessarily equivalent on a Riemannian manifold. We 
define a geodesic by generalizing the notion of “straightness.” For this, it is not 
necessary to have a metric, but only a connection. On a Riemannian manifold, of 
course, there is always the unique Riemannian connection, and so one can speak of 
geodesics on a Riemannian manifold. 


$13 More on Affine Connections 


This chapter is a compilation of some properties of an affine connection that will 
prove useful later. First we discuss how an affine connection on a manifold M ind- 
uces a unique covariant derivative of vector fields along a smooth curve in M. This 
generalizes the derivative dV /dt of a vector field V along a smooth curve in R”. 
Secondly, we present a way of describing a connection in local coordinates, using 
the so-called Christoffel symbols. 


13.1 Covariant Differentiation Along a Curve 


Let c: [a,b] + M be a smooth parametrized curve in a manifold M. Recall that a 
vector field along the curve c in M is a function 


V: (a,b) [| Tw, 
| 


tela,b 
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where [| stands for the disjoint union, such that V(t) € T.(r)M (Figure 13.1). Such a 
vector field V(t) is C® if for any C® function f on M, the function V(t)f is C® as a 
function of t. We denote the vector space of all C* vector fields along the curve c(t) 
by I(TM|,(1)). 


Fig. 13.1. Vector field along a curve c(t) in M. 


For a smooth vector field V(t) = ¥v'(t)0/dx' along a smooth curve c(t) in R", 
we defined its derivative dV /dt by differentiating the components v/(t) with respect 
tof. Then dV /dt = >. v'd /dx' satisfies the following properties: 


(i) dV /dt is R-linear in V; 
(ii) for any C* function f on [a,b], 
d(fV) df dV 


Hd ae 


(iii) (Proposition 4.11) if V is induced from a C™ vector field V on R’", in the sense 
that V(t) = Voir) and D is the directional derivative in R”, then 


It turns out that to every connection V on a manifold M one can associate a way 
of differentiating vector fields along a curve satisfying the same properties as the 
derivative above. 


Theorem 13.1. Let M be a manifold with an affine connection V, and c: [a,b] + M 
a smooth curve in M. Then there is a unique map 


D 
51 M(TMlew) FET Mle) 


such that for V €T(TM|.q)), 
(i) (R-linearity) DV /dt is R-linear in V; 
(ii) (Leibniz rule) for any C® function f on the interval {a,b}, 


D(fV) df, ,DV. 
dt aE ae? 
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(iii) (Compatibility with V) if V is induced from a C® vector field V on M, in the 
sense that V(t) =V.), then 


We call DV /dt the covariant derivative (associated to V) of the vector field V 
along the curve c(t) in M. 


Proof. Suppose such a covariant derivative D/dt exists. On a framed open set 
(U,e1,..-,€n), a vector field V(t) along c can be written as a linear combination 


Then 
DV Dns . 
ae tae (v'(t)eze)) (by property (i)) 
dy! ;De; ae 
= a sar (by property (ii)) 
dvi i eg 
=> weit Vine: (by property (iii), (3.0) 


where we abuse notation and write e; instead of e; .(;). This formula proves the 
uniqueness of D/dt if it exists. 

As for existence, we define DV /dt for a curve c(t) in a framed open set U by the 
formula (13.1). It is easily verified that DV /dt satisfies the three properties (i), (ii), 
(iii) (Problem 13.1). Hence, D/dt exists for curves in U. If é1,...,é, is another frame 
on U, then V(t) is a linear combination ¥ 7! (t)é @;-(r) and the covariant derivative 
DV /dt defined by 


DV 
ay Sate, (1) ei 


also satisfies the three properties of the theorem. By the uniqueness of the covariant 
derivative, DV /dt = DV /dt. This proves that the covariant derivative DV /dt is ind- 
ependent of the frame. By covering M with framed open sets, (13.1) then defines a 
covariant derivative DV /dt for the curve c(t) in M. 


Theorem 13.2. Let M be a Riemannian manifold, V an affine connection on M, and 
c: [a,b] + Ma smooth curve in M. If V is compatible with the metric, then for any 
smooth vector fields V,W along c, 


d DV DW 
VW W)+CYV, ; 
at i ( dt ’ ) ( dt ) 
Proof. It suffices to check this equality locally, so let U be an open set on which an 
orthonormal frame e1,...,é, exists. With respect to this frame, 


V= v(t) Jeie(t W= yw! (t) 
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for some C® functions v',w/ on [a,b]. Then 


d 


d bai dv! jdw' 
Pru aero hia =a eo he 


By the defining properties of a covariant derivative, 
on 
=25 rete FP =LS Pe tv Me eis 


where we again abuse notation and write e; instead of e; oc. Similarly, 


DW _ dw! 


IVS sD vtwllen Wawel 
at 2. in Vel (t)Ej)- 
i ij 


Since €1,...,@, are orthonormal vector fields on U and V is compatible with the 
metric, 


(Vernyeirej) + (ei, Vernyei) = ¢' (t)(ei,e;) = c(t) ij = 0. 


Therefore, 


Example 13.3. If V is the directional derivative on R” and V(t) = ¥v'(t)0/dx' is a 
vector field along a smooth curve c(t) in R”, then the covariant derivative is 


dv' a ; _wdvid _ dv 
a laet Lv Dei loa e 


since D.(,0 /dx' = 0 by (4.2). 


13.2 Connection-Preserving Diffeomorphisms 


Although it is in general not possible to push forward a vector field except under a 
diffeomorphism, it is always possible to push forward a vector field along a curve 
under any C” map. Let f: M —> M be aC® map (not necessarily a diffeomorphism) 
of manifolds, and c: [a,b] + M a smooth curve in M. The pushforward of the vector 
field V(t) along c in M is the vector field (f,V)(t) along the image curve f oc in M 
defined by 


(FV) (t) = fac (V(0))- 
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Denote by (M,V) a C® manifold with an affine connection V. We say that a 
C* diffeomorphism f: (M,V) — (M,V) preserves the connection or is connection- 
preserving if for all X,Y € X(M), 


fa(VxY) = Vax fel. 


In this terminology, an isometry of Riemannian manifolds preserves the Riemannian 
connection (Problem 8.2). 

We now show that a connection-preserving diffeomorphism also preserves the 
covariant derivative along a curve. 


Proposition 13.4. Suppose f: (M,V) —> (M,V) is a connection-preserving diffeo- 
morphism, c(t) a smooth curve in M, and D/dt, D/dt the covariant derivatives 
along c in M and f oc in M, respectively. If V(t) is a vector field along c in M, 


then 
p(B) — BUY) 
dt dt — 
Proof. Choose a era, The U of c(t) on which there is a frame ¢,...,@,, and 
write V(t) = Yvi(t Je: cr). Then @ := f.e1,...,€n ‘= fren is a frame on the neigh- 


borhood f(U) of f(c (t yy in M and 


=Y v(t) i,(foc)( 


By the definition of the covariant derivative, 


DV dy! j 
at ra )= > at (t Je €ic(t) TV (t) Vert) ei- 


Because f preserves the connection, 
DV dv' bak 
(¥. ) (t) = y ae) (pew) +V (t)V pcr) fui 


= ap Fil roete) FY OY (foey(n ei 


13.3 Christoffel Symbols 


One way to describe a connection locally is by the connection forms relative to a 
frame. Another way, which we now discuss, is by a set of n> functions called the 
Christoffel symbols. 
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The Christoffel symbols are defined relative to a 
coordinate frame. Let V be an affine connection on ~~ 
a manifold M and let (U,x!,...,x") be a coordinate 
open set in M. Denote by 0; the coordinate vector >. “hy 
field 0/dx'. Then V9,0; is a linear combination of ) ~ 
O|,.--,0, SO there exist numbers ri, at each point 
such that 
These n° functions ry are called the Christoffel sym- 
bols of the connection V on the coordinate open set 
(U,x!,...,x"). By the Leibniz rule and ¥-linearity Blin Bane Christoftel 
in the first argument of a connection, the Christoffel 
symbols completely describe a connection on U. (1829-1900) 


n 
Va; = > Tho. 
k=1 


Proposition 13.5. An affine connection V on a manifold is torsion-free if and only if 
in every coordinate chart (U,x',...,x") the Christoffel symbol ae is symmetric in i 
and j: 


Proof. (=>) Let (U,x!,...,x") be a coordinate open set. Since partial differentiation 
is independent of the order of differentiation, 
ad oO 
lai, 0) = 5 = 


By torsion-freeness, 
V4,9; — Va, = [0,0j] =0. 


In terms of Christoffel symbols, 
YF % — YTV 5A = 0. 
k k 


Since 0j,...,0, are linearly independent at each point, ae = I 


(<) Conversely, suppose Ty = rm, in the coordinate chart (U,x!,...,x”). Then 
V3,0; = Va,0i. Hence, 


T (0;,0;) = Va,0j — Va,0i = 0. 


Since T(, ) is a bilinear function on 7,M, this proves that T(X,Y)p:=T(Xp,Yp) =0 
for all X,,Y¥, € T,M. Thus, for all X,Y € X(M), we have T(X,Y) =0. 


Remark 13.6. Because of this proposition, a torsion-free connection is also called a 
symmetric connection. 
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Example 13.7 (Christoffel symbols for the Poincaré half-plane). The Poincaré half- 
plane (Example 12.6) is covered by a single coordinate open set (U,x,y). Let 0) = 
0/Ax, 02 = 0/dy be the coordinate frame. We showed in Example 12.6 that its 
connection form os} relative to the orthonormal frame e; = yO), €2 = yoo is 


1 
ao} = —-dx. 
y 
Then 
1 
oO} = -e}, 02 = -e2, 
y y 


and for any smooth vector field X on the Poincaré half-plane, 


1 1 1 
VxO1 = Vx (5) =X (5) ey+ —Vyxe] 
y y J 


1 1 
= ~~ (Xy)yAi + — a7 (X)eo 
y y 
1 1 
= a = pon es 
1 1 
= ——(Xy)d; + —(Xx)a2 (13.2) 
y y 
Similarly, 


1 1 
Vx02 = re ca _ je (13.3) 


By (13.2) and (13.3), relative to the coordinate frame 0;,02, the connection is 
given by 


1 1 
Va,91 = —0>, Va,%1 = —-dO1, 

y y 

1 1 
V9,0.=—-9O1, V9,02 = —-0. 

y y 


Therefore, the Christoffel symbols Ty for the Poincaré half-plane are 


Sok 1 > 
ij 

11 0 I/y 
12 -1/y 0 
pa -1/y 0 
22 0 -I/y 
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Problems 


13.1. Covariant derivative along a curve 
Verify that if DV /dt is defined by (13.1), then it satisfies the three properties (i), (ii), and (iii) 
of Theorem 13.1. 


13.2. Covariant derivative on a surface in R* 
Let M be a surface in R? with its Riemannian connection: for X,Y € X(M), 


VxY = pr(DxY). 


If V(t) is a vector field along a curve c(t) in M, show that DV /dt = pr(dV /dt). (Hint: Show 
that pr(dV /dt) verifies the three properties of the covariant derivative.) 


13.3. Chain rule for the covariant derivative 
Suppose M is a manifold with a connection and DV /dt is the covariant derivative of a vector 
field V along a curve c(t) in M. If ¢ is a C® function of another variable u, then V gives rise 
to a vector field V(t(w)) along the curve ¢(u) = c(t(u)). We will write V(u) = V (t(u)). Show 
that Z 

DV _ DV dt 

du dt du’ 


13.4. Christoffel symbols of a surface of revolution 
Let M be a surface of revolution in R? with parametrization 


f(u)cosv 
w(u,v) = | f(u)sinv|, O<v<2a 
gu) 
as in Problem 5.7. Let V be the Riemannian connection on M. Find the Christoffel symbols 
of V with respect to u,v. 


13.5. Christoffel symbols of the Poincaré disk 
Compute the Christoffel symbols of the Riemannian connection on the Poincaré disk D in 
Problem 11.1. 
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814 Geodesics 


In this section we prove the existence and uniqueness of a geodesic with a specified 
initial point and a specified initial velocity. As an example, we determine all the 
geodesics on the Poincaré half-plane. These geodesics played a pivotal role in the 
history of non-Euclidean geometry, for they proved for the first time that the parallel 
postulate is independent of Euclid’s first four postulates. 


14.1 The Definition of a Geodesic 
A straight line in R” with parametrization 
c(t)=pt+ty, p,veR’, 


is characterized by the property that its acceleration c’(t) is identically zero. If 
T(t) =c'(t) is the tangent vector of the curve at c(t), then c(t) is also the covariant 
derivative DT /dt associated to the Euclidean connection D on R”. This example 
suggests a way to generalize the notion of “straightness” to an arbitrary manifold 
with a connection. 


Definition 14.1. Let M be a manifold with an affine connection V and / an open, 
closed, or half-open interval in R. A parametrized curve c: I > M is a geodesic if 
the covariant derivative DT /dt of its velocity vector field T(t) = c’(t) is zero. The 
geodesic is said to be maximal if its domain J cannot be extended to a larger interval. 


Remark 14.2. The notion of a geodesic depends only on a connection and does not 
require a metric on the manifold M@. However, if M is a Riemannian manifold, then 
we will always take the connection to be the unique Riemannian connection. On a 
Riemannian manifold, the speed of a curve c(t) is defined to be the magnitude of its 
velocity vector: 


lle") = V(e'(),"(@))- 
Proposition 14.3. The speed of a geodesic on a Riemannian manifold is constant. 


Proof. Let T = c'(t) be the velocity of the geodesic. The speed is constant if and 
only if its square f(t) = (T,T) is constant. But 


So f(t) is constant. 
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Let M be a smooth surface in R*. Recall that its Riemannian connection is 
given by 
Vx¥ = (DxY )tan, 
the tangential component of the directional derivative DyY. If N is a unit normal 


vector field on an open subset U of M, then the tangential component of a vector 
field Z along M is 


Zan = Z—(Z,NYN. 


By Problem 13.2, for a vector field V along a curve c(t) in M, the covariant derivative 
associated to the Riemannian connection V on M is 


bv _(av 
de Nae ye 


Example 14.4 (Geodesics on a sphere). On a 2-sphere M of radius a in R’, let y(t) 
be a great circle parametrized by arc length. Then y(t) has unit speed. Differentiating 


(7(@),¥@) =1 


with respect to ¢ gives 
2(7"(t), ¥ (t)) =0. 


This shows that the acceleration y’(t) of a unit-speed curve in R? is perpendicular to 
the velocity. 

Since y(t) lies in the plane of the circle, so do y(t) and y(t). Being perpendic- 
ular to y(t), the acceleration y’(t) must point in the radial direction (Figure 14.1). 
Hence, because y(t) is perpendicular to the tangent plane at y(r), 


eee: (=) =¥" (tan =0. 


dt 


Fig. 14.1. Velocity and acceleration vectors of a great circle. 
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14.2 Reparametrization of a Geodesic 
In this section we show that a geodesic can be reparametrized as long as the 
reparametrization is a linear function. 


Proposition 14.5. Suppose y(u) is a nonconstant geodesic on a manifold with a con- 
nection and Y(t) := y(u(t)) is a reparametrization of y(u). Then Y(t) is a geodesic if 
and only if u = at + B for some real constants o and B. 

Proof. Let T(u) = y(u) and T(t) = 7 (t) be the tangent vector fields of the two 
curves y(u) and 7(t). By the chain rule, 


By property (ii) of a covariant derivative, 


DT ow ! DT (u(t)) 
DE =u u(t) +ul QPL) 
=u" (t)T(u(t)) + v()>- a (by the chain rule, Problem 13.3) 
u 


=u"(t)T (u(t), 
where DT /du = 0 because y(u) is a geodesic. Since T(u) has constant length, it is 
never zero. Therefore, 
DT , 
dt : 
for some a, B ER. 


0 u=at+B 


Corollary 14.6. Let (a,b) be an interval containing 0. For any positive constant 
k ER‘, the curve y(u) is a geodesic on (a,b) with initial point q and initial vector v 
if and only if ¥(t) := y(kt) is a geodesic on (a/k,b/k) with initial point q and initial 
vector ky. 


Proof. Since 

a b 

SESE Se ax<kt<b, 
the curve 7(t) := y(kt) is defined on (a/k,b/k) if and only if y(t) is defined on (a, D). 
By the chain rule, 


7 (t) =ky (kt). (14.1) 


Equivalently, 
T(t) =kT (kt), 
so that 7 
DT DT 
——(t) =k’ (ia). 
dt (t) dt “ 


Thus, 7 is a geodesic if and only if y is a geodesic. They have the same initial point 
7(0) = (0). By (14.1), their initial vectors are related by 


7 (0) =k/ (0). 
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14.3 Existence of Geodesics 


Suppose M is a manifold with a connection V, and (U,@) = (U,x',...,x”) is a chart 
relative to which the Christoffel symbols are Ty. Let c: [a,b] > M be a smooth 
curve. In the chart (U,@) the curve c has coordinates 


(@ oc)(t) = (a'(€e(t)),..-,2"(e(F))). 
We will write y(t) = (@ o c)(t) and y(t) = (x! o c)(t), and say that c is given in U by 
y(t) ="), (0). 


In this section we will determine a set of differential equations on y'(r) for the curve 
to be a geodesic. 
Denote by »* the first derivative dy‘ /dt and by ¥ the second derivative dy’ /dt*. 
Let 0; = 0/dx'. Then 
= YH (t) 


and 


= 59}, ok LH Ve 9 
J J 

=>, HOpea)+ > WVyi area) OF" (14.2) 
j ij 


To simplify the notation, we sometimes write 0; to mean Oj c(t) and yj to mean 
T¥,(c(t)). Then (14.2) becomes 


J = LI} + LH V9 8) 
J ij 
= + > WTF oR 
k 


i,j,k 


= (x +2 WTk, 1) O. 


k 


So c(t) is a geodesic if and only if 


P+ RTH = = k=1,...,n. 


We summarize this discussion in the following theorem. 


Theorem 14.7. On a manifold with a connection, a parametrized curve c(t) is a 
geodesic if and only if relative to any chart (U,@) = (U,x!,...,x"), the components 
of (@ 0 c)(t) = (y!(t),...,y"(t)) satisfy the system of differential equations 


j RTH = = k=1,...,n, (14.3) 


where the Ty ’s are evaluated on c(t). 
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This is a system of second-order ordinary differential equations on the real line, 
called the geodesic equations. By an existence and uniqueness theorem of ordinary 
differential equations, we have the following theorem. 


Theorem 14.8. Let M be a manifold with a connection V. Given any point p © M 
and tangent vector v € TpM, there is a geodesic c(t) in M with initial point c(0) = p 
and initial velocity c'(0) =v. Moreover, this geodesic is unique in the sense that 
any other geodesic satisfying the same initial conditions must agree with c(t) on the 
intersection of their domains. 


Example 14.9 (Geodesics on a sphere). Suppose y: [a,b] > S* is a geodesic. For 
to € [a,b], there is a great circle through y(fo) with velocity y/(to). From Exam- 
ple 14.4, we know that this great circle is a geodesic. Since a geodesic with a given 
point and a given velocity vector at the point is unique (Theorem 14.8), y(t) must co- 
incide with a great circle wherever it is defined. This proves the converse of Example 
14.4 that all geodesics on S* lie on great circles. 


Let ¥,(t,q) be the unique maximal geodesic with initial point q and initial vector 
v € T,M. We also write y,(t). By Corollary 14.6, 


Ww (kt) = Yev(t) 


for any positive real number k. 

On a Riemannian manifold we always use the unique Riemannian connection to 
define geodesics. In this case, tangent vectors have lengths and the theory of ordinary 
differential equations gives the following theorem. 


Theorem 14.10. For any point p of a Riemannian manifold M, there are a neighbor- 
hood U of p and numbers 6,a > 0 so that for any q € U and v € T,M with |\v|| < 6, 
there is a unique geodesic y: (—a,a) + M with y(0) = q and y' (0) =v. 


A priori the interval of definition (—a,a) of the geodesic in this theorem may be 
quite small. However, by rescaling the interval by a constant factor (Corollary 14.6), 
one can expand the interval to any finite length, at the expense of making the initial 
vector shorter. 


Theorem 14.11. For any point p of a Riemannian manifold M, there are a neigh- 
borhood U of p and a real number € > 0 so that for any q € U and v € TjM with 
\|v|| < €, there is a unique geodesic 7: (—2,2) + M with 7(0) = q and ¥ (0) =. 


Proof. Fix p € M and find a neighborhood U of p and positive numbers 6 and a as 
in Theorem 14.10. Set k=a/2. By Corollary 14.6, y(t) is a geodesic on (—a,a) 
with initial point q and initial vector v if and only if 7(t) := y(kt) is a geodesic on 
(—a/k,a/k) = (—2,2) with initial point g and initial vector 9 := kv. Moreover, 

ad 


Ml =NZON <6 = [lr = [17% Ol = ky] < k= >. 


Choose € = a6 /2. Then Theorem 14.10 translates into this one. 
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Proposition 14.12. A connection-preserving diffeomorphism f: (M,V) — (M,V) 
takes geodesics to geodesics. 


Proof. Suppose c(t) is a geodesic in M, with tangent vector field T(t) = c(t). If 
T(t) = (f oc)'(t) is the tangent vector field of f oc, then 


F(t) =f. (<.5) =A('@)=AT. 


Let D/dt and D/dt be the covariant derivatives along c(t) and (f 0 c)(t), respectively. 
Then DT /dt = 0, because c(t) is a geodesic. By Proposition 13.4, 


DT DT 
“a (=) =a 


Hence, f oc is a geodesic in M. 


Because an isometry of Riemannian manifolds preserves the Riemannian con- 
nection (Problem 8.2), by Proposition 14.12 it carries geodesics to geodesics. 


14.4 Geodesics in the Poincaré Half-Plane 


The Poincaré half-plane is covered by a single coordinate open set with coordinates 
x,y (Figure 14.2). In Example 13.7 we calculated its Christoffel symbols. The 


UY 


Fig. 14.2. The Poincaré half-plane. 


system of geodesic equations (14.3) for the geodesic c(t) = (x(t),y(t)) consists of 
two equations: 


2 
X¥— —xy = 0, (14.4) 
y 
1 1 
J+ =f SO, (14.5) 
y y 
There is a third equation arising from the fact that a geodesic has constant speed 


(Proposition 14.3): 
(c(t), é(t)) = constant. (14.6) 
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This equation is a consequence of the first two, because any curve satisfying Equa- 
tions (14.4) and (14.5) will be a geodesic and hence will have constant speed. We 
include it because of its simplicity. In fact, by a simple reparametrization, we may 
even assume that the speed is 1. Hence, (14.6) becomes 


aay 
seh 
y 
or 
P+ypay’, (14.7) 


Assuming x not identically zero, we can divide (14.4) by x and solve it by sepa- 
ration of variables: 


Integrating both sides with respect to ¢ gives 
Inx = 2(Iny) + constant = In(y”) + constant. 


Hence, 
x= ky’ (14.8) 


for some constant k 
Plugging this into (14.7) and solving for y gives 


Pyt+y ay’, (14.9) 
y= tyV1—k*y?. (14.10) 
Dividing (14.8) by (14.10), we get 


dx ky? _ ky 
dy“ y/1-Ry ~ V/1- Ry? 


Integrate with respect to y to get 


1 
eV 1 —k?y? + constant d. 


This equation can be put in the form 


(dP ty =o, 
which is the equation of a circle centered on the x-axis (Figure 14.3). 

In our derivation so far, we assumed that the first derivative x is not identically 
zero. It remains to consider the case x = 0. If x = 0, then x = constant, so the curve 
is a vertical line. Thus, the geodesics in the Poincaré half-plane are either vertical 
lines or semicircles centered on the x-axis. 
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kK— + I 
Fig. 14.3. A geodesic in the Poincaré half-plane. 


given point 


| <— given line 


Fig. 14.4. Geodesics in the Poincaré half-plane. 


In this calculation we never used (14.5). It is in fact a redundant equation 
(Problem 14.1). 

The Poincaré half-plane is important historically, since it provides a model of 
a non-Euclidean geometry. In Euclid’s axiomatic development of plane geometry, 
the first four postulates are usually considered self-evident [10, pp. 14-18]. The 
fifth postulate, called the parallel postulate, was a source of controversy. Two lines 
in the plane are said to be parallel if they do not intersect. One form of the fifth 
postulate states that given a line and a point not on the line, there is a unique parallel 
line through the given point. For hundreds of years, heroic efforts were made to 
deduce the fifth postulate from the other four, to derive a contradiction, or to prove 
its independence. If we interpret the geodesics of the Poincaré half-plane as lines, 
then the Poincaré half-plane satisfies Euclid’s first four postulates, but not the parallel 
postulate: given a geodesic, say a vertical line, and a point not on the line, there are 
infinitely many geodesics through the point and not intersecting the given line (see 
Figure 14.4). In this way the Poincaré half-plane proves conclusively that the parallel 
postulate is independent of Euclid’s first four postulates. 


14.5 Parallel Translation 


Closely related to geodesics is the notion of parallel translation along a curve. A par- 
allel vector field along a curve is an analogue of a constant vector field in R”. 
Throughout the rest of this chapter we assume that M is a manifold with a connection 
V and c: J M is a smooth curve in M defined on some interval /. 


Definition 14.13. A smooth vector field V(t) along c is parallel if DV /dt = 0 on I. 


In this terminology a geodesic is simply a curve c whose tangent vector field 
T(t) =c'(t) is parallel along c. 
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Fix a point p = c(t) and a frame e1,...,é, in a neighborhood U of p in M. 
Let [o'] be the matrix of connection forms of V relative to this frame. If V(r) = 
Lv (tej, (2) for c(t) € U, then 


DV Pe Pei, 

a 2? Coty 2 a W) 

= > Ve: ct ga pS WV (t)@j 
i J 

— pe Vee) bn ps vl oi (c (1) )eic(r) 
i uj 

=¥ (HF + Loi) ere: 
i J 


Thus DV /dt = 0 if and only if 


Y+>¥ ai(c())/ =0, i=1,...,n. (14.11) 


This is a system of linear first-order ordinary differential equations. By the existence 
and uniqueness theorem of ordinary differential equations, there is always a unique 
solution V(t) on a small interval about to with a given V(to). In the next subsection 
we show that in fact the solution exists not only over a small interval, but also over 
the entire curve c. 

If V(t) is parallel along a curve c: [a,b] > M, then we say that V(b) is obtained 
from V(a) by parallel translation along c or that V(b) is the parallel translate or 
parallel transport of V(a) along c. By the uniqueness theorem of ordinary differen- 
tial equations, V(t) is uniquely determined by the initial condition V (a), so if parallel 
translation exists along c, then it is well defined. 


14.6 Existence of Parallel Translation Along a Curve 


While a geodesic is guaranteed to exist only locally, parallel translation is possible 
along the entire length of a curve. 


Theorem 14.14. Let M be a manifold with a connection V and let c: |a,b] + M be 
a smooth curve in M. Parallel translation is possible from c(a) to c(b) along c, i.e., 
given a vector vo € T.(q), there exists a parallel vector field V(t) along c: |a,b] + 
M such that V(a) = vo. Parallel translation along c induces a linear isomorphism 


a,b: Te(a)(M) = Tp) (M). 


Proof. Because the parallel transport equation DV /dt = 0 is R-linear in V, a lin- 
ear combination with constant coefficients of parallel vector fields along c is again 
parallel along c. 

Let w1,...,W» be a basis for T.,,)M. For each i = 1,...,n, there is an interval 
inside [a,b] of length €; about fo such that a parallel vector field W;(r) exists along 
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c: (to — &,to + &) — M whose value at fo is w;. For € equal to the minimum of 
€1,---,€n, the n basis vectors w1,...,W, for Te(iy)M can be parallel translated along 
c over the interval (f — €,49 +€). By taking linear combinations as in the remark 
above, we can parallel translate every tangent vector in T.(,.)M along c over the 
interval (to — €,t0 + €). 

For t; € (to — €,f9 + €) parallel translation along c produces a linear map Qj, 1, : 
Te(iy)M — T.,)M. Note that a vector field V(r) is parallel along a curve c(r) if and 
only if V(—t) is parallel along c(—r), the curve c reparametrized with the opposite 
orientation. This shows that the linear map @,);, has an inverse @;, ;. and is therefore 
an isomorphism. 

Thus for each t € [a,b] there is an open interval about t over which parallel trans- 
lation along c is possible. Since [a,b] is compact, it is covered by finitely many such 
open intervals. Hence, it is possible to parallel translate along c from c(a) to c(b). 


While a geodesic with a given initial point and initial velocity exists only locally, 
parallel translation is always possible along the entire length of a smooth curve. In 
fact, the curve need not even be smooth. 


Definition 14.15. A curve c: [a,b] > M is piecewise smooth if there exist numbers 
a=to<t)<::-<t-=b 
such that c is smooth on [t;,t;41] fori =0,...,r—1. 


By parallel translating over each smooth segment in succession, one can parallel 
translate over any piecewise smooth curve. 


14.7 Parallel Translation on a Riemannian Manifold 


Parallel translation is defined on any manifold with a connection; it is not necessary 
to have a Riemannian metric. On a Riemannian manifold we will always assume that 
parallel translation is with respect to the Riemannian connection. 


Proposition 14.16. On a Riemannian manifold M parallel translation preserves 
length and inner product: if V(t) and W(t) are parallel vector fields along a smooth 
curve c: [a,b] > M, then the length ||V(t)|| and the inner product (V(t),W(t)) are 
constant for all t € [a,b]. 


Proof. Since ||V(t)|| = /(V(t),V(¢)), it suffices to prove that (V(t),W(t)) is con- 
stant. By the product rule for the covariant derivative of a connection compatible 
with the metric (Theorem 13.2), 


d DV DW 
5 vw) =( a w) (¥, = )=0 


since DV /dt = 0 and DW /dt = 0. Thus (V(t),W(t)) is constant as a function of t. 
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Since the angle 0 between two vectors u and v can be defined in terms of length 
and inner product by 


cos@ = 


, O<OK<T, 


parallel translation also preserves angles. 


Fig. 14.5. Parallel translating v along a closed piecewise smooth geodesic. 


Example 14.17 (Parallel translation on a sphere). Let y, and y be two meridians 
through the north pole p of a sphere S* making an angle @ with each other, and let 
v be a vector at p tangent to y, (Figure 14.5). As great circles, both yj; and 7 are 
geodesics on the sphere. We will parallel translate v along y; to the equator, along 
the equator to ~», and then along }% back to the north pole p. 

Since y; is a geodesic, its tangent vector field is parallel, so the parallel translate 
of v to the equator remains parallel to yj, and perpendicular to the equator. Since 
parallel translation preserves angles, as v is parallel translated along the equator to 
%, it remains perpendicular to the equator at all times. When it reaches ~% along the 
equator, it will be tangent to 72. Parallel translating this vector along y back to p 
results in a tangent vector to ¥ at p. Figure 14.5 shows the position of v at various 
points of its journey. From this example we see that parallel translating a tangent 
vector of a manifold around a closed loop need not end in the original vector. This 
phenomenon is called holonomy. 


Problems 


14.1* Geodesic equations 
Show that if we differentiate (14.10) with respect to t, we end up with (14.5). Hence, (14.5) is 
redundant. 


14.2. Surface of revolution in R? 


In Problem 5.7 you compute the mean and Gaussian curvatures of a surface of revolution M 
+ 13 
in R’. 
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(a) Using the notations of Problem 5.7, find the geodesic equations of the surface of revolution. 

(b) Prove that the meridians of the surface of revolution are geodesics. 

(c) Find a necessary and sufficient condition on f(u) and g(u) for a latitude circle to be a 
geodesic. 


14.3. Poincaré disk 
Let D be the Poincaré disk (Problem 11.1). 


(a) Show that in polar coordinates (r, 0), the Poincaré metric is given by 


A(dr@dr+r?d0@dé) 
( ’ (7,8) = ( —P 


Using polar coordinates, compute for D 


(b) the Gaussian curvature, 
(c) the Christoffel symbols, 
(d) the geodesic equations. 
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$15 Exponential Maps 


There are two exponential maps in geometry: the exponential map of a connection 
and the exponential map for a Lie group. While these are two independent notions, 
when a Lie group has a bi-invariant Riemannian metric, as all compact Lie groups 
do, the exponential map for the Lie group coincides with the exponential map of the 
Riemannian connection. 

On a manifold M with a connection, a geodesic is locally defined by a system 
of second-order ordinary differential equations. By the existence and uniqueness 
theorems of ordinary differential equations there is a unique geodesic through any 
given point q with any given direction v. However, this geodesic may be very short, 
defined only on a small interval (—a,a) about 0. By reparametrizing the geodesic 
by a constant factor, one can expand the domain of definition of the geodesic at the 
expense of shortening the initial vector. 

On a Riemannian manifold every point p has a neighborhood U in which there 
is a uniform bound such that all geodesics starting in U with initial vectors of length 
less than the bound will be defined on an interval including [—1, 1] (Theorem 14.11). 
For any q € U this leads to the definition of the exponential map Exp, from a small 
neighborhood of the origin in 7,M to the manifold M. This exponential map derives 
its importance from, among other things, providing coordinate charts in which any 
isometry is represented by a linear map (see Section 15.2). 

The exponential map for a Lie group G is defined in terms of the integral curves 
of the left-invariant vector fields on G. Unlike the exponential map of a connection, 
the Lie group exponential is defined on the entire Lie algebra g. It shares some of 
the properties of the exponential map of a connection. The problems at the end of 
the chapter explore the relationship between the two notions of exponential map. 


15.1 The Exponential Map of a Connection 


Suppose M is a manifold with an affine connection. For any point g € M and vector 
v € T,M, denote by y,(t,qg) or simply y,(t) the unique maximal geodesic with initial 
point y,(0) = g and initial vector 7/(0) = v. If y,(1) is defined, we set 


Exp,(v) = w(1). 


Now assume that M is endowed with a Riemannian metric. For any point p € M, 
Theorem 14.11 guarantees a neighborhood U of p and a real number € > 0 such that 
for all q € U the exponential map Exp, is defined on the open ball B (0,€) in TM: 


Exp,: T;M > B(0,€) > M. 


Let s: M > TM be the zero section of the tangent bundle TM, and let s(U) be the 
image of U C M under s. Viewed as a function of two variables (gq, v), the exponential 
map Exp,(v) is defined on an €-tube around s(U) in TM (see Figure 15.1). 
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k— U ——— 


Fig. 15.1. A neighborhood U of p and the domain of the exponential map in the tangent bundle 
TM. 


Proposition 15.1. On a manifold with a connection the maximal geodesic with initial 
point q and initial vector v is ¥,(t,q) = Exp,(ty). 
Proof. 


Exp,(tv) = Y(1) (definition of Exp,) 
=(t) (Corollary 14.6). 


Theorem 15.2 (Naturality of the exponential map). Let f: N — M be an isometry 
of Riemannian manifolds, and p € N. Suppose V C TN and U C Ty:p)M are neigh- 
borhoods of the origin on which the exponential maps Exp, and Exp (p) are defined. 
If the differential f, ) maps V into U, then the following diagram is commutative: 


yy 
Ex, | EXp/(p) 


Proof. For v € V C TpN, let ¥(t,p) be the maximal geodesic in N with initial 
point p and initial vector v. Since an isometry takes geodesics to geodesics, 
(f o %)(t, p) is the maximal geodesic in M with initial point f(p) and initial vec- 
tor f,v. By the uniqueness of the geodesic, 


(fo W(t, P) = Ypv(t, (Pp). 


Setting t = | gives 


F(Exp, v) = Exp fp) (fev). 


15.2 The Differential of the Exponential Map 117 


15.2 The Differential of the Exponential Map 


Suppose p is a point in a Riemannian manifold N and V C 7,N is a neighborhood 
of 0 on which the exponential map Exp, is defined. Since T,N is a vector space, its 
tangent space 7p(T;,N) at the origin can be canonically identified with T,N. 


Proposition 15.3. The differential (Exp,)x,0 at the origin of the exponential map 
Exp,: V C T,N — N is the identity map 17,n : TpN — T)N. 


Proof. For v € T,N, a curve c(t) in T,N with c(0) = 0 and c’(0) = v is c(t) = tv. 
Using this curve to compute the differential of Exp,,, we have 


d d 
(Exp, )«,0(v) =a, gerne (c(t) =a, Ee 
= y(t) (Proposition 15.1) 
dt |,—0 
_ (0) a) 


where ¥,(t) is the unique geodesic with initial point p and initial vector v. 


By the inverse function theorem, the map Exp,,: V C T,N — N is a local diffeo- 
morphism at 0 € T,N. Thus, there are neighborhoods V’ of 0 in T,N and V” of p in 
N such that Exp,: V’ + V" is a diffeomorphism. 

Now suppose f: N — M is an isometry of Riemannian manifolds. As above, 
there are neighborhoods U’ of 0 in T;;,)M and U" of f(p) in M such that Exp f(,) : U' 
— U" isa diffeomorphism. Since f,,, is continuous, one may choose V’ sufficiently 
small so that f., maps V’ to U’. By the naturality of the exponential map (Theorem 
15.2) there is a commutative diagram 


fap 
TN > FF U' Cc Typ)M 
Exp, | = = | EXP /(p) 
N Dd yp" U" Cc M. 


This diagram may be interpreted as follows: relative to the coordinate charts 
given by the inverse of the exponential map, an isometry of Riemannian manifolds 
is locally a linear map; more precisely, it is the linear map given by its differential. 
This is in contrast to the usual statement about the differential f, , at a point p being 
the best linear approximation to a C® map in a neighborhood of the point. Here the 
isometry f is equal to its differential right on the nose. 
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15.3 Normal Coordinates 


Fix a point p in a Riemannian manifold M. By the preceding section, there is a 
neighborhood V of 0 in 7,M and a neighborhood U of p in M such that the expo- 
nential map Exp,: V — U is a diffeomorphism. Using the exponential map we can 
transfer coordinates on T,M to M. Choose an orthonormal basis ¢1,...,@, for T,M 
and let r!,...,r” be the coordinates with respect to the orthonormal basis e1,...,€p 
on 7,M. Then xii=r!o Exe yosae =r" o Exp," is a coordinate system on U 
such that the tangent vectors 0/dx!,...,0/0x" are orthonormal at p. The coordinate 
neighborhood (U,x!,...,x") is called a normal neighborhood of p and x',...,x" are 
called normal coordinates on U. 

In a normal neighborhood of p, the geodesics through p have a particularly sim- 
ple expression, for the coordinate expression for the geodesic y(t) = Exp,(at) for 
a= ae € T,M is 

x(y(t)) =re Exp, (y(t)) = at. 


We write this as (x!,...,x”) = (a't,...,a"t). 


Theorem 15.4. In a normal neighborhood (U,x',...,x") of p, all the partial deriva- 
tives of gi; and all the Christoffel symbols Ty, vanish at p. 


Proof. Let (x!,...,x") = (a!t,...,a"t) be a geodesic through p. It satisfies the 
geodesic equations 


a kosiej 
tye 0, FS Tyucn 
ij 
or 
on omy oe 
DLV iyja‘'a! =0. 


Since this is true for all (a!,...,a") at p, setting (a!,...,a") =(0,...,1,0,...,0,1,...,0) 
with a! = a/ = | and all other entries 0, we get 


k Ko 
rk +Tk =0. 


By the symmetry of the connection, ae = 0 at the point p. (At other points, not all 
of (x!,...,x”) = (a!t,...,a"t) will be geodesics.) 
Write 0; for 0/dx'. By the compatibility of the connection V with the metric, 


K8ij = %(0;,0;) = (Va,0;,0;) + (0;, Va,9;)- 


At p, Vg,0; = 0 and V9,0; = 0 since all the Christoffel symbols vanish. Therefore, 
(%gij)(P) = 9. 


Normal coordinates are especially useful for computation, because at the point 
p, all Vg, 0; = 0. 
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15.4 Left-Invariant Vector Fields on a Lie Group 


Let G be a Lie group and g = 7,G its Lie algebra. Any element X, € g determines 
a left-invariant vector field X on G by Xz := (€,).(Xe). This establishes a one-to- 
one correspondence between 7.G and the vector space of left-invariant vector fields 
on G. Sometimes we adopt an alternate notation for a left-invariant vector field on a 
Lie group: if A € g, then the left-invariant vector field on G generated by A is A, with 
Ag = ly,A. 

Proposition 15.5. [f @,(p) is an integral curve starting at p of a left-invariant vector 
field X on a Lie group G, then for any g € G, the left translate g@,(p) is the integral 
curve of X starting at gp. 


Proof. We compute the velocity of g@,(p): 


d 
7 (g@:(p)) = (le) ane (p) (definition of d/dt and the chain rule) 
= (Lg) xX, (p) (@;(p) is an integral curve of X) 
= Xgo,(p) (X is left-invariant). 


This proves that g@,(p) is also an integral curve of X. At t=0, gQ,(p) = 
890(P) = gp. 


Corollary 15.6. The local flow ©, of a left-invariant vector field X on a Lie group 
G commutes with left multiplication: 0 @; = Q; ° €g for all g € G, whenever both 
sides are defined. 


Proof. By Proposition 15.5, both g@,(p) and @,(gp) are integral curves of X starting 
at gp. By the uniqueness of the integral curve, g@,(p) = @(gp). This can be rewrit- 


ten as (Le 0 @)(P) = (Gro Le) (P). 


Proposition 15.7. The maximal integral curve ©,(p) of a left-invariant vector field 
X ona Lie group G, where p is a point of G, is defined for allt € R. 


Proof. By an existence theorem of ordinary differential equations, there are a real 
number € > 0 and an integral curve @,(e): [—€,€] + G of X through e. By Propo- 
sition 15.5, p@,(e) = ;(p) is an integral curve of X defined on [—€, €] through any 
point p € G. Let g = @¢(p) be the endpoint of @,(p) on [—e€,é€]. Then @,(q) is an 
integral curve of X defined on [—€,€]. The two integral curves @,(p) and @,(q) agree 
on their overlap and so the domain of @;(p) can be extended to [—€,2¢]. Thus, the 
domain of an integral curve of X through any point can always be extended an extra 
€ unit. By induction, the maximal integral curve of X through any point is defined 
for allt € R. 


It follows from this proposition that a left-invariant vector field X on a Lie group 
G has a global flow 9: Rx GG. 
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For a Lie group G there is also a notion of an exponential map. Unlike the expo- 
nential map of a connection, which is a priori defined only on a small neighborhood 
of 0 in the tangent space 7,M of each point p in a manifold with a connection, the 
exponential map for a Lie group is defined on the entire tangent space T.G, but only 
for the tangent space at the identity. When the Lie group has a bi-invariant Rieman- 
nian metric (Section 15.8), the exponential map for the Lie group coincides with the 
exponential map of the Riemannian connection. 

For X. € g := TG, denote by X the left-invariant vector field on G generated 
by X,. In this section we consider primarily integral curves starting at the identity e. 
To show the dependence on X, we write cy (t) = @;(e) for the integral curve through 
e of the left-invariant vector field X on G. 

If s is a real number, then 


d 
ack (st) =sck(st) (chain rule) 
= $X,y(s¢) (ex is an integral curve of X). 


This shows that cx(st) as a function of ¢ is an integral curve through e of the left- 
invariant vector field sX. Hence, 


cx (st) = Csx (t). (15.1) 


Definition 15.8. The exponential map for a Lie group G with Lie algebra g is the 
map exp: g — G defined by exp(X,) = cxy(1) for X, € g. 


To distinguish the exponential map of a connection from the exponential map for 
a Lie group, we denote the former as Exp, and the latter as exp. 


Proposition 15.9. (i) For X, € g, the integral curve starting at e of the left-invariant 
vector field X is exp(tXe) = cx (t) = ;(e). 

(ii) For X- € g and g €G, the integral curve starting at g of the left-invariant vector 
field X is gexp(tX,). 

(iii) For s,t € R and X. € g, exp ((s+1)X-) = (expsXe)(exprXe). 

(iv) The exponential map exp: g — G is aC™ map. 

(v) The differential at 0 of the exponential map, exp, 9: To(g) = 9 > TeG = g is the 
identity map. 

(vi) For the general linear group GL(n,R), 


expA= a for A € gl(n,R). 
k=0 © 


Proof. (i) By the definition of exp and by (15.1), 
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(ii) This follows from (i) and Proposition 15.5. 
(iii) View s as a constant and differentiate with respect to f: 


d 
Hex OS +8) = x(S +1) = Keg (sts): 
This shows that cx (s-+t) as a function of t is an integral curve with initial point cx (s) 
of the left-invariant vector field X. With s fixed, cy (s)cx(t) is also an integral curve 
of X with initial point cy(s). By the uniqueness of integral curves, 


cx(s+t) =cx(s)cx(t). 


Using (i), we can rewrite this equation as exp ((s+1)X.) = (exp sX,) (exp tX,). 
(iv) (following [22], p. 103) The proof is based on the fact from the theory of ordinary 
differential equations that the flow of a C™ vector field is C”. Recall that for each 
X. € g, X is the left-invariant vector field generated by X,. Define a vector field V on 
Gx g by 

V(g,x-) = (Xg,0) = (Cgx (Xe), 0). 
By (ii), the integral curve of V starting at (g, X.) is c(t) = (gexp tX.,X-). The global 
flow of V is 


o:Rx(Gxg) > Gxg, 
g(t, (g,Xe)) i (gexp tXe,Xe). 
Let 7: Gx g > G be the projection to the first factor. Then 
exp(Xe) = (1 @) (1, (e,Xe)). 


As a composite of C” maps, exp: g > Gis C™. 
(v) For X. € g, a C® curve starting at 0 in g with initial vector X, is c(t) = tXe. 
Computing the differential exp, ) using this curve, we get by (i) that 


exp(tX,) = Xe. 
t=0 


exp,.o(Xe) 


d 
exp c(t) = Ai 


dt |,_9 


(vi) We know from ([21], Sect. 15.3, p. 170) that c(t) = Deo Atk /k! is absolutely 
convergent for A € gl(n,IR). Denote by A the left-invariant vector field on GL(n, R) 
generated by A. By [21, Example 8.19], 


Ag = (lg)xA = 8A. 


Now c(0) = /, the identity matrix, and 


122 815 Exponential Maps 


Hence c(t) is the integral curve of A through /. By (i), 


Setting t = 1 gives the desired identity. 


Definition 15.10. A 1-parameter subgroup of a Lie group G is a group homomor- 
phism g: RG. 


In this terminology, Proposition 15.9(i11) says that if X is a left-invariant vector 
field on a Lie group G, then its integral curve cy (t) = exp(tX.) through the identity 
e is a 1-parameter subgroup of G. 


Corollary 15.11. The diffeomorphism @,: G — G induced from the flow of a left- 
invariant vector field X on a Lie group G is right multiplication by exp tX-¢: 


1 (8) = exp 1x. (8). 


Proof. Applying Corollary 15.6 to e, we get 


(8) = Pr (ge) = 8 (€) = Tg,(c) (8) = Fexprx, (8). 


15.6 Naturality of the Exponential Map for a Lie Group 


Just as the exponential map for a Riemannian manifold is natural with respect to 
isometries, so the exponential map for a Lie group is natural with respect to Lie 
group homomorphisms. 


Theorem 15.12. Let H and G be Lie groups with Lie algebras h and g, respectively. 
If f: H — Gis a Lie group homomorphism, then the diagram 


) 9 
exp | | exp 


ts 


commutes. 
Lemma 15.13. /f f: H — Gis a group homomorphism, then for any h € H, 

foln = Lyn) of. (15.2) 
Proof (of lemma). For x € H, 


(fo ln)(x) = flhx) = FA) F(a) = (Cpt 2 f))- 
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Proof (of theorem). For A € b, denote by A the left-invariant vector field on the Lie 
group H generated by A. If c(t) is an integral curve of A through e, then 


e' (0) =A = (C5) 9A (15.3) 


We will now show that (f © c)(t) is an integral curve of f,A, the left-invariant 
vector field on G generated by fA € g. Taking the differential of (15.2), we get 


fe 0 Ly = Crh) o fx. (15.4) 
Then 
(foe! OS fhe (by the chain rule again) 


= fle(t)eA (by (15.3)) 
= £F(c(t))x(feA) (by (15.4)) 


= SrA (foc) (t): 


By the definition of the exponential map for a Lie group, 


exp(frA) = (fe c)(1) = f(c(1)) = flexpa). 


15.7 Adjoint Representation 


Let G be a Lie group with Lie algebra g. For g € G, define cg: G — G to be conju- 
gation by g: 
Ce(x) = gxg 


The differential at the identity of cg is denoted by 


Ad(g) = (Cg)se! 9 9: 

For g,hEG, 
Ceh = Cg 0 Ch, 
so that by the chain rule, 
(Cen) = (Cg) e (ch)«, 

or 

Ad(gh) = Ad(g) o Ad(h). 
Thus, Ad: G — GL(g) is a group homomorphism; it is called the adjoint represen- 
tation of the Lie group G. 


Proposition 15.14. Let G be a Lie group. The adjoint representation Ad: G + 
GL(g) is aC” map. 
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Proof. By the definition of a Lie group, the Lie group multiplication u: Gx G— G 
and inversion g++ g~! are C® maps. In terms of LL, 


GA) = i alae), gx EG, 
which shows that as a function of g and x, the conjugation cg(x) is also C®. 
Let x!,...,x" be a coordinate system in a neighborhood of the identity e. Then 
Ad(g) = (cg)«, is represented by the Jacobian matrix [(0(x! o cg)/Ax/)(e)]. Since all 
the partial derivatives 0 (x! 0 cg)/Ax/(e) are C* in g, Ad(g) is a C® function of g. 


The differential of Ad at the identity is a Lie algebra homomorphism 
ad := (Ad)x¢: g > gl(g), 
called the adjoint representation of the Lie algebra g. We usually write ad(A)(B) as 
ad, B. 
Proposition 15.15. For A,B € g, ad, B = [A,B]. 


Proof. For A € g, let A be the left-invariant vector field generated by A. We will write 
the exponential exp(A) as e“. By Proposition 15.9(i), the integral curve through e of 
A is @;,(e) = e'4, and by Corollary 15.11 the diffeomorphism 9,: G — G induced 
from the flow of A is right multiplication by e'4. For A,B € g, 


ad, B = (Ad,.-A)(B) (definition of ad) 
d tA : : tA 
eri Ad(e)(B) (computing Ad, using the curve @;(e) = e”) 
t=0 
d 3 
=—|  (c,a), (B) (definition of Ad) 
dt t=0 
d oye . : tA 
ier ; (r,-1a), (C4), B (definition of conjugation c(e’)) 
t= 
d ~ “3 * 
er (r.-14), Bota (definition of B) 
t=0 
d B — tA 
= dt (9-1) (Baa) (@ = A, Or(e) =e ) 
t=0 
= (£;8). (definition of the Lie derivative £;B) 
= [A,B]. ({21, Th. 20.4, p. 225]) 
= [A,B] (definition of [A, B]). 


15.8 Associativity of a Bi-Invariant Metric on a Lie Group 


A Riemannian metric ( , ) on a Lie group G is said to be left-invariant if for all C* 
vector fields X,Y on G and g € G, 

(CgxX ,lgx¥) = (X,Y). 
A right-invariant metric is defined similarly with rz, in place of ¢,. A metric that is 
both left- and right-invariant is said to be bi-invariant. 
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Proposition 15.16. [f ( , ) is a bi-invariant metric on a Lie group G, then for all 
left-invariant vector fields X ,Y,Z on G, 


([X,¥],Z) = (X, [¥,Z]). 
Proof. Since ( , ) is bi-invariant, for any g € G and X.,Z, € g, 
(X.,Z-) = (CgaF g-14Xey l gal g-14Ze) 
= ((Adg)Xe, (Adg)Ze). 
If c(t) is the integral curve through e of the left-invariant vector field Y, then 
(X.,Ze) = ((Adc(t)) Xe, (Adc(t))Z.) for all t. (15.5) 
Differentiating (15.5) with respect to ¢ and evaluating at t = 0 gives 


d 

dt t=0 
= ((Ad,c!(0)) Xe, (Ade(0)) Ze) + ((Adc(0)) Xe, (Ads c/(0))Ze) 
= (ady, Xe,Ze) Tr (Xe, ady, Ze} 

= ([Yo,Xe],Ze) + (Xe, [¥e,Ze]) (by Proposition 15.15) 

= —([Xe,¥e],Ze) + (Xe, [Ye,Ze]). 


0 ((Ade(t)) Xe, (Ade(t)) Ze) 


Since (, ) is left-invariant and 0, ..[Xe, Yo] = [lg «(Xe), ly.%(Ye)| = [Xe, Ye] for all g € G 
(see [21, Proposition 16.14, p. 185]), left-translating the equation above by g gives 
the proposition. 


Problems 


15.1. Exponential maps on R? 
Give R? its usual Euclidean metric. 


(a) At any point p € R?, the tangent space TpR? can be canonically identified with R?. Show 
that under this identification TR? ~ R’, the exponential map Exp Pe TpR? — R? is the 
identity map. 

(b) Viewed as a Lie group under addition, R? has Lie algebra R?. Show that the exponential 
map exp: R? > R? is also the identity map. 


15.2. Exponentials as generators 
Prove that a connected Lie group G with Lie algebra g is generated by expA for all A € g; i-e., 
every element g € Gis a product of finitely many exponentials: g = (expA,)---(expA,). 


15.3. Invariant Riemannian metrics on a Lie group 
Let G be a Lie group and g its Lie algebra. Suppose ( , )¢ is an inner product on g. For 
any g € Gand X,,¥, € T,G, define 


Y= C4 baie 
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(a) Show that ( , ) is a left-invariant Riemannian metric on G. 
(b) Show that this sets up a bijection 


{inner products on g} <> {left-invariant Riemannian metrics on G}. 


(c) Show that under the bijection in (b), Ad(G)-invariant inner products on g correspond to 
bi-invariant Riemannian metrics on G. 


15.4. Riemannian connection on a Lie group with a bi-invariant metric 

Let G be a Lie group with a bi-invariant Riemannian metric. Prove that if V is its Rieman- 
nian connection, then VyY = 5X ,Y] for all left-invariant vector fields X and Y on G. (Hint: 
Use (6.8) and Proposition 15.16.) 


15.5. Geodesics on a Lie group with a bi-invariant metric 


(a) Let G be a Lie group with a bi-invariant Riemannian metric. Prove that the geodesics on G 
are precisely the integral curves of left-invariant vector fields. (Hint: Let c(t) be an integral 
curve through e of a left-invariant vector field X. Apply Problem 15.4 to show that c(t) is 
a geodesic.) 

(b) Show that if a Lie group has a bi-invariant Riemannian metric, then the exponential map 
for the Lie group coincides with the exponential map Exp, of the Riemannian connection. 


15.9 Addendum. The Exponential Map as a Natural 
Transformation 


The naturality of the exponential map for a Riemannian manifold says in fact that 
the exponential map is a natural transformation between two functors. There are two 
functors lurking in the background in Theorem 15.2, and in this addendum we will 
ferret them out. 

Let F and G be two functors from a category A to a category B. A natural 
transformation from F to G is a collection of morphisms @4: F(A) — G(A) in B, 
one for each object A in A, such that for each morphism f: A — A’ in A the diagram 


F(4) 2. F(a) 
*| |r 
G(A) aan G(4’) 


is commutative. 

Let A be the category in which the objects are pointed Riemannian manifolds 
(M, p) and for any two objects (N,q) and (M, p), a morphism (N,q) > (M, p) is an 
isometry f: N — M such that f(q) = p. 

Next, we define the category B. Consider the set of all pairs (U, p), where U is 
an open subset of a manifold M and p € U. We say that two pairs (U, p) and (V,q) 
are equivalent, written (U, p) ~ (V,q), if p = q and U and V are open subsets of the 
same manifold. This is clearly an equivalence relation. An equivalence class of such 
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pairs (U, p) with U C M is called a germ of neighborhoods of p in M. A morphism 
f: [(U,p)] > [(V,q)] from the germ of (U, p) to the germ of (V,q) is a collection of 
C* maps fy yr: (U',p) — (V',q) such that 


(U',p) ~ (U,p), (V'.q)~(V,q), 


and for any overlapping (U’, p) and (U”, p) on which fy y: and fy” y are defined, 


Fv gisgH = fury" graye : 


This collection { fy y} need not be defined on all (U’, p) € [(U, p)]; it is enough that 
fu.y' be defined on some neighborhood U’ of p in M. As the category B, we take 
the objects to be all germs of neighborhoods and the morphisms to be morphisms of 
germs of neighborhoods. 

The functor F: A — B takes a pointed Riemannian manifold (M, p) to the germ 
[(7,M,0)| of neighborhoods of the origin in the tangent space 7,,M, and takes an 
isometry f: (N,q) > (M,p) of pointed Riemannian manifolds to the morphism of 
germs of neighborhoods induced by the differential f, 4: (T;N,0) + (7)M,0). 

The functor G: A — 8 takes a pointed Riemannian manifold (M, p) to its germ 
[((M, p)] of neighborhoods, and takes an isometry f: (N,q) — (M,p) to the mor- 
phism of germs of neighborhoods induced by /. 

The exponential map Exp, of a Riemannian manifold N at g induces a morphism 
Exp, : [(Z;N,0)| + [(N,q)] of germs of neighborhoods. In the language of categories 
and functors, Theorem 15.2 is equivalent to the statement that the exponential map 
is a natural transformation from the functor F to the functor G. 
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$16 Distance and Volume 


In differential topology, where two manifolds are considered the same if they are 
diffeomorphic, the concepts of length and volume do not make sense, for length 
and volume are clearly not invariant under diffeomorphisms. In differential geome- 
try, where two Riemannian manifolds are considered the same if they are isometric, 
length and volume do make sense, for they are invariant under isometries. In this 
chapter we define length and distance on a Riemannian manifold and construct a 
volume form on an oriented Riemannian manifold. 


16.1 Distance in a Riemannian Manifold 


In a Riemannian manifold we defined the length of a tangent vector X, € T,M to be 


I|Xp|| =V/ (Xp,Xp). 


If c: [a,b] + M is a parametrized curve in M, its arc length is given by 


«= [leolet 


We can reparametrize c(t) via a diffeomorphism t(u): [a1,b1] — [a,b]; the 
reparametrization of c(t) is then c(t(u)): [a,,b1] + M. We say that the reparametriza- 
tion is orientation-preserving if it preserves the order of the endpoints: t(a,) =a 
and t(b,) = b; it is orientation-reversing if it reverses the order of the endpoints. 
Since a diffeomorphism t(u): [a1,b1] — [a,b] is one-to-one, it is either increasing or 
decreasing. Let u(t) be its inverse. Because u(t(u)) = u, by the chain rule, 


which shows that dt /du is never zero. So dt/du is either always positive or always 
negative, depending on whether the reparametrization t(u) preserves or reverses the 
orientation. 


Proposition 16.1. The arc length of a curve c: [a,b] > M in a Riemannian manifold 
M is independent of its parametrization. 


Proof. Suppose c(t(u)),a; <u < by is an orientation-preserving reparametrization. 
Then dt/du > 0. By the chain rule, 


d ' dt 
mot) =e(t(u))F. (16.1) 


It follows that 
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—_c(t(u)) 


[ 
aj 


er} 


au = | "jet onsl 


= ee u))|| od 


a| 
= 7 \|c(¢)||dt (change of variables formula). 
a 


If c(t(u)) is an orientation-reversing parametrization, the calculation is the same 
except that (i) |dt /du| = —dt /du, because dt /du < 0, and (ii) the limits of integration 
in the last line are reversed. These two changes cancel out: 

bill d yy dt 
—c(t du = | ‘(t — 

[| Zete|au= [re oan |4 
dt dt 


= -[ IIc’ (¢(u))|| ant _ [ IIc’ (t(u)) | Tn 


b 
= / \|c’(¢)||dt (change of variables formula). 
a 


So the arc length is still independent of the parametrization. 


Given two points p and q on a connected Riemannian manifold, we define the 
distance between them to be 


d(p,q) = inf ¢(c), 


where the infimum is taken over all piecewise smooth curves c from p to g and &(c) 
is the length of the curve c. 

For the distance d(p,q) to be defined, there must be a curve joining p and gq. 
Hence, in order to have a distance function, the Riemannian manifold should be 
path-connected. By a theorem of general topology, a locally path-connected space is 
path-connected if and only if it is connected (Problem A.2). Being locally Euclidean, 
a manifold is locally path-connected. Thus, for a manifold, path-connectedness is 
equivalent to connectedness. Whenever we speak of the distance function on a Rie- 
mannian manifold M, we will assume M to be connected. 

It is easily verified that the distance function on a Riemannian manifold satisfies 
the three defining properties of a metric on a metric space: for all p,q,r € M, 


(i) positive-definiteness: d(p,q) > 0 and equality holds if and only if p = q; 
(ii) symmetry: d(p,q) = d(q, p): 
(iii) triangle inequality: d(p,r) < d(p,q) +4d(q,r). 
Therefore, with the distance as a metric, the Riemannian manifold M becomes a 
metric space. 


Remark 16.2. This metric is not the Riemannian metric, which is an inner product on 
the tangent space 7,,M at each point p of the manifold. In the literature a “Rieman- 
nian metric” is sometimes shortened to a “metric.” So the word “metric” can have 
two different meanings in Riemannian geometry. 
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Example 16.3. Let M be the punctured plane R* — {(0,0)} with the Euclidean metric 
as the Riemannian metric. There is no geodesic from (—1,—1) to (1,1), because 
the origin is missing from the manifold, but the distance between the two points is 
defined and is equal to 2,\/2 (Figure 16.1). This example shows that the distance 
between two points need not be realized by a geodesic. 


(1,1) 


(-1,-1) 


Fig. 16.1. Punctured plane. 


16.2 Geodesic Completeness 


In differential topology, a curve c: (a,b) + M ina manifold M can be reparametrized 
so that its domain is R, because the interval (a,b) is diffeomorphic to R. Now sup- 
pose the manifold M is Riemannian and the curve is a geodesic. By Proposition 14.5, 
any reparametrization of the geodesic which keeps it a geodesic must have the form 
t = au+f, for a,B €R. Such a linear change of variables maps a finite interval to 
another finite interval. Thus, whether or not a geodesic has the entire real line R as 
its domain is independent of the parametrization. 


Definition 16.4. A Riemannian manifold is said to be geodesically complete if the 
domain of every geodesic in it can be extended to R. 
Example. The Euclidean space R” is geodesically complete. 


Example. The sphere S? is geodesically complete. 


Example. The punctured Euclidean plane R* — {(0,0)} is not geodesically com- 
plete, since the domain of the geodesic c(t) = (t,t), 1 <t < ce cannot be extended to 
—oo < ft <9, 


Definition 16.5. A geodesic defined on [a,b] is said to be minimal if its length is 
minimal among all piecewise smooth curves joining its two endpoints. 


A proof of the following fundamental theorem on geodesic completeness may be 
found in [19, Volume 1, Theorem 18, p. 342]. 
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Heinz Hopf (1894-1971) and Willi Rinow (1907-1979) 


Theorem 16.6 (Hopf—Rinow theorem). A Riemannian manifold is geodesically 
complete if and only if it is complete as a metric space in the distance metric d 
defined above. Moreover, in a geodesically complete Riemannian manifold, any two 
points may be joined by a minimal geodesic. 


16.3 Dual 1-Forms Under a Change of Frame 


Many of the constructions in differential geometry are local, in terms of a frame 
of vector fields and its dual frame of 1-forms, for example, the connection matrix 
and the curvature matrix of an affine connection. To see how the construction trans- 
forms under a change of frame, it is useful to know how the dual frame of 1-forms 
transforms. 

If e1,...,@, and @,,...,@, are two C™ frames on an open set U in a manifold M, 
then 


= Laei (16.2) 
I 


for some C™ function [a‘]: U — GL(n,R). Let g!,...,0" and 9!,...,0” be the dual 
frames of 1-forms, respectively, meaning 


6'(e))=5; and 6'(2;)=5i, 


where 5 is the Kronecker delta. To write 6!,...,6” in terms of @!,...,0”, it is best 
to use matrix notation. 
In matrix notation we write the frame e1,...,é@, aS a row vector 


e=[e, --- ey] 
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and the dual frame @!,...,@” as a column vector 
g! 
@=]|: 
Q” 
The duality is expressed in matrix notation as a matrix product: 
g! 
Oe= | : | [er ++ en) =[6'(e;)] = [6)] =7. 
(a 
Equation (16.2) translates into 
é= (@ --- &) = [e, --- en|la’)] = eA. 
Proposition 16.7. Let e = |e, --- e,| and é= [é, --- &,] be two frames on an open 


set U. If €=eA for A: U + GL(n,R), then the dual frames 6 and 0 are related by 
6=A-'9. 


What this means is that if 6’ = dbo, then B= [>'] =Ap 


Proof. Since 
(A-1@)e¢=Al@eA=ATIA=I, 


we have 6 =A~'@. 


Next we study how 6! --- 6” is related to 0! A --- A 0”. 


Proposition 16.8 (Wedge product under a change of frame). Let V be a vector 
space of dimension n and 6',...,0"a basis for the dual space VY. If the 1-covectors 
6!,..., 0" and @',...,0” are related by 9' = dbie/, then with B = [b', 


6! A---\6" = (detB)a!A---A@". 


Proof. Problem 16.3. 


16.4 Volume Form 


On an oriented manifold of dimension n there are many nowhere-vanishing n-forms. 
If @ is a nowhere-vanishing n-form, so is any nonzero multiple of @. In general, it is 
not possible to single out one of them as the volume form. If the oriented manifold is 
Riemannian, however, we will show that there is a canonically defined volume form. 

Let (U,x!,...,x") be an arbitrary chart on an oriented Riemannian manifold M. 
Applying the Gram—Schmidt process to the coordinate frame 01,...,0n, we can con- 
struct an orthonormal frame e1,...,@, on U. Let @!,...,0” be the dual frame of 
1-forms and define 

@=0'/A---A6". 


Then @ is a nowhere-vanishing n-form on U. 
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To see how @ depends on the choice of an orthonormal frame, let @;,...,@, be 
another orthonormal frame on U. Then there are C™ functions a’ : U > R such that 


2) = Yiaie;. (16.3) 


Since both e1,...,é@, and @),...,@, are orthonormal, the change of basis matrix A = 
[a' is a function from U to the orthogonal group O(n). Let 6!,...,6” be the dual 
frame of @1,...,@. By Proposition 16.7, 6! = D(a-'), a. It then follows from 
Proposition 16.8 that 


6'A---AG" =det(A)alA---A 0". 


We now assume that e and @ are both positively oriented. Then the matrix function 
A: U -+ SO(n) assumes values in the special orthogonal group. Since A~! € SO(n), 
det(A~!) = 1, and so 

61 A---A6" =O! A.A 8". 


This proves that a = 6! A --- A @” is independent of the choice of the positively 
oriented orthonormal frame. It is a canonically defined n-form on an oriented Rie- 
mannian manifold M. We call @ the volume form of M. In case the integral fy, @ is 
finite, we call f,, @ the volume of the oriented Riemannian manifold M. 


Example 16.9 (Volume form on R?). On R?, an orthonormal basis is 0/dx,0/dy, 
with dual basis 9! = dx, @? = dy. Hence, the volume form on R? is dx A dy. 


Example 16.10 (Volume form on TH’). By Section 12.5, an orthonormal frame on the 
Poincaré half-plane Hi? is e; = yO /dx, e2 = yd /dy, with dual frame 6! = (dx)/y, 
6? = (dy) /y. Hence, the volume form on H? is dx A dy/y’. 


The notion of a tangent bundle extends to a manifold M with boundary 0M (see 
(21, Section 22.4, pp. 253—254]); hence so does the notion of a Riemannian metric: 
a Riemannian metric on a manifold M with boundary is a positive-definite symmet- 
ric bilinear form on the tangent bundle TM. If M is a Riemannian manifold with 
boundary, then 0M inherits naturally a Riemannian metric from M. 


Theorem 16.11. Let voly and volgy be the volume forms on an oriented Riemannian 
manifold M and on its boundary 0M. Assume that the boundary is given the 
boundary orientation [21, §22.6]. If X is the outward unit normal along 0M, then 
volgy = tx(voly), where tx is the interior multiplication with X. (For the basic 
properties of interior multiplication, see [21, Section 20.4, pp. 227—229].) 


Proof. Let p be a point in the boundary 0M. If (X,e2,...,n) is a positively oriented 
orthonormal frame field for M in a neighborhood U of p, then by the definition of 
the boundary orientation, (e2,...,@,) is a positively oriented orthonormal frame for 
OM in UM OM. Let 6!,...,0” be the dual frame to X,e2,...,@,. Clearly, 07,...,0” 
is dual to e2,...,e@, on UM OM. So 


voly = 0!A---A@" and volay=07A---A@". 


134 §16 Distance and Volume 
At a point of 0M, 
ly (voly) = tx ped 
= LC ir '9i(x ) OA ABTA AO" 


6) (X) @7A---A@" 
=07A---A0" =volgy. 


Example 16.12 (The volume form on a circle). The volume form on the unit disk 
D={(x,y) €R? |x? +y’ <1} 
is dx \ dy. The unit outward normal X along its boundary 0D = S! is 


re) 0 


By Theorem 16.11, the volume form on the unit circle oriented counterclockwise is 


@s1 = lx (dx A dy) = (txdx)dy — dx(txdy) 
= (Xx) dy—dx(Xy) = xdy—ydx. 


16.5 The Volume Form in Local Coordinates 


On a coordinate chart (U Ce ccs ,x") of a Riemannian manifold M, the volume form 
voly, being a nowhere-vanishing top form, is a multiple fdx! \--- dx" of dx! A 

-- A dx" for some nonvanishing function f on U. The next theorem determines f in 
terms of the Riemannian metric. 


Theorem 16.13. Let (U,x!,...,x") be a coordinate chart on a Riemannian manifold 
M, and let g;; = (0;,0;) = (0/dx',d/dx/). Then the volume form of M on U is 
given by 


voly = \/detg dx! A--- Adx", 
where g is the matrix |g;/|. 


Proof. Let e1,...,@, be an orthonormal frame on U, with dual frame of 1-forms 
6',...,0". Then dj = dia .e; for some matrix-valued function A = [a a’: U> 


GL(n,R). The dual frame to 0,...,0) is dx!,...,dx". By Proposition 16.7, 
dx' =; bio! with B= [>'] = A7!. By Proposition 16.8, 


dx! \.--\dx" = (detB)@!A---A@". 
Hence, 


voly = 0! A---A@" = (detB) !dx! A---Adx" 
= (detA)dx' A---Adx". 


16.5 The Volume Form in Local Coordinates 135 
To find detA, note that 


&ij = (0;,0;) = (Y afer, a‘er) = ¥ aia’ (ex, e0) 
= ¥ aha’ Sec = Yafat = (ATA); 
k 


Therefore, 
g= [gi] =A™A, 
so that 
det g = det(A’A) = (detA)”. 
It follows that 
detA = ,/detg 


and 


voly = (detA)dx! A--»Adx" = \/detg dx! A--- dx". 


Problems 


16.1. Transition matrix for orthonormal bases 
Suppose e1,...,@, and @),...,@, are two orthonormal bases in an inner product space V. Prove 


that if 2; = Layer then the matrix A = [a‘] is orthogonal, i.e., A7A = I. 


16.2. The triangle inequality 
Prove the triangle inequality for the distance function on a Riemannian manifold M: for all 


P.areM, 
d(p,r) < d(p,q) +d(q,r). 


16.3. Wedge product under a change of frame 
Prove Proposition 16.8. 


16.4. Volume form of a sphere in Cartesian coordinates 
Let S”~!(a) be the sphere of radius a centered at the origin in R”. Orient S”~!(a) as the 


boundary of the solid ball of radius a. Prove that if x!,...,x” are the Cartesian coordinates on 
R", then the volume form on S”~!(a) is 
1 n . . —~ 
Vol gn-1(q) = = Y(—1)1x'dx! A+++ Adxt A+++ A dx". 
4 j=] 


(Hint: Instead of finding an orthonormal frame on the sphere, use Theorem 16.11.) 


16.5. Volume form of a 2-sphere in spherical coordinates 

Parametrize the 2-sphere M of radius a in R? by spherical coordinates @,@, where @ is the 
angle in the (x,y)-plane relative to the positive x-axis and @ is the angle relative to the positive 
z-axis (Figure 16.2). Then 


x asin @ cos @ 
y asing sin@ | , 0<0<2n, 0<¢<Z. 
Zz acos@ 
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Fig. 16.2. Spherical coordinates in R?. 


(a) Compute 0/0 and 0/00 and show that they are orthogonal to each other. (Hint: Use 
[21, Prop. 8.10, p. 90] to write 0/d@ and 0/08 in terms of 0/dx, 0/dy, 0/0z.) 

(b) Let e; = (0/06) /||0/06|, ex = (0/00)/||0/6||. Calculate the dual basis a!, a? of 
1-forms and the volume form on the 2-sphere of radius a in terms of a,@, 0. 

(c) Calculate the volume of M. (Since M is a surface, by its volume we mean its surface area.) 


16.6. Volume form of an n-sphere in spherical coordinates 
For n > 3 we define the spherical coordinates on R” as follows (see Figure 16.3). Let rz, be 
the distance of the point (x;,...,x,) from the origin in R*: 


re = fx te $xg. 


(In this problem we use subscripts instead of superscripts on coordinate functions so that a 
means the square of x;.) The spherical coordinates on IR? are the usual polar coordinates 
r=r2,0,0< @ < 22. Forn > 3, if x = (x1,...,%n), then the angle @, is the angle the vector 
x makes relative to the x,-axis; it is determined uniquely by the formula 
x, 
COS Pn = =, O< dr <a. 
r, 


n 


Project x to R’~! along the x,-axis. By induction the spherical coordinates r,_1,0, $3, ..-, 
on—1 Of the projection (x1,...,X%,—1) in R”—! are defined. Then the spherical coordinates 
of (x1,..-,4n) in R” are defined to be rn, 0,03,..-, 9-1, n-. Thus for k = 3,...,n, we have 
COS Oy = xx/Tg. Let S°—!(a) be the sphere r, = a. 


(a) Find the length of the vectors 0/00, 0/0, for k =3,...,n. 

(b) Normalize the vectors in part (a) to obtain an orthonormal frame e2,...,é, on the sphere 
sre 1 ( a) ; 

(c) Give the sphere sno (a) the boundary orientation of the closed solid ball of radius a. (For 
a discussion of boundary orientation, see [21, Section 21.6, p. 255].) Show that the volume 
form on S’~! (a) in spherical coordinates is up to sign 


o =a"! (sin”? dn)(sin” > dn—1) +++ (sings) dO A do3 \--- Addn. 


16.7. Surface area of a sphere 
Because the “volume of a sphere” usually means the volume of the ball enclosed by the sphere, 
we will call the integral of the volume form on a sphere the surface area of the sphere. 
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Fig. 16.3. Spherical coordinates in R”. 


(a) By integration by parts, show that 
1 -_ 1 1 
[ sin” odo =" ‘i sin”? dd. 
0 n 0 


(b) Give a numerical expression for Io sin2* odo and So sin?! odd. 
(c) Compute the surface area of a sphere of radius a. Treat the even-dimensional case and the 
odd-dimensional case separately. 


16.8. Volume form on a smooth hypersurface in R” 

Let f: R” — R be a C® function whose partial derivatives 0 f/x! are not simultaneously 
zero at any point of the zero locus Z(f) of f. By the regular level set theorem, Z(f) is a 
smooth manifold of dimension n— 1 (see [21, Th. 9.9, p. 105]). It inherits a Riemannian 
metric from R”. 


(a) Show that the gradient vector field grad f = (Of /0x')0/0x! is a nowhere-vanishing 
normal vector field along Z(f). 

(b) The unit normal vector field X = grad f/||grad f|| on Z(f) defines an orientation on Z(f): 
an orthonormal frame (e3,...,@,) at p € Z(f) is said to be positive for T,(Z(f)) if and 
only if (X,e2,...,€n) is positive for T,(IR”). Compute the volume form on Z(f) with this 
orientation. 
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$17 The Gauss—Bonnet Theorem 


According to an elementary theorem in Euclidean geometry, the three angles of a 
triangle in the plane add up to 180°. This simple fact has a beautiful generalization 
to any polygon on a Riemannian 2-manifold. The formula, called the Gauss—Bonnet 
formula for a polygon, relates the sum of the angles of a polygon to the geodesic 
curvature of the edges of the polygon and the Gaussian curvature of the surface. 
As a corollary, the angles of a geodesic triangle on a Riemannian 2-manifold can 
add up to more or less than 180°, depending on whether the Gaussian curvature is 
everywhere positive or everywhere negative. 

The Gauss—Bonnet formula for a polygon leads to the Gauss—Bonnet theorem 
for an oriented Riemannian 2-manifold, according to which the total Gaussian cur- 
vature f;, K vol of an oriented Riemannian 2-manifold M is equal to 27 times the 
Euler characteristic y¥(M). The most striking feature of this theorem is that while the 
Gaussian curvature K depends on the Riemannian metric, the Euler characteristic 
does not. Thus, although the Gaussian curvature depends on the metric, its integral 
does not. Read another way, the Gauss—Bonnet theorem also implies that although 
the Euler characteristic by definition depends on a polygonal decomposition of the 
surface, because the total curvature does not, the Euler characteristic is in fact inde- 
pendent of the polygonal decomposition. 

It follows that two homeomorphic compact orientable surfaces have the same 
Euler characteristic, since a homeomorphism carries a polygonal decomposition 
from one surface to the other and preserves the number of vertices, edges, and faces. 
Thus, at least for compact orientable surfaces, the Euler characteristic is a topologi- 
cal invariant. It is the first instance of a topological invariant that can be constructed 
using curvature. 


17.1 Geodesic Curvature 


In the same way the curvature of a curve in Euclidean space is defined using the 
usual derivative (Problem 2.7), we can define the geodesic curvature of a curve in a 
Riemannian manifold using the covariant derivative along the curve. 

Consider a unit-speed curve y(s): [a,b] > M in a Riemannian manifold M. The 
Riemannian connection of M induces a covariant derivative D/ds along the curve. 
Let T(s) = y(s). The curve y(s) is a geodesic if and only if DT /ds vanishes iden- 
tically. Thus, the magnitude ||DT /ds|| gives a measure of the extent to which y(s) 
fails to be a geodesic. It is called the geodesic curvature K, of the curve y(s): 


_ DT 
“eas 


So defined, the geodesic curvature is always nonnegative. 
It follows directly from the definition that a unit-speed curve is a geodesic if and 
only if its geodesic curvature is zero. 
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17.2 The Angle Function Along a Curve 


Ata point p of an oriented Riemannian manifold M, the angle ¢ between two vectors 
u and v in the tangent space T,,M is given by the formula 
cos = ae 
Ile [I 
In general, the angle € is defined only up to an integer multiple of 27. 

Suppose now that (U,e1,e2) is a framed open set on which there is a positively 
oriented orthonormal frame e1,e2. If c: [a,b] > U is a C® curve, let ¢(t) be the 
angle from ej.) to c/(t) in T.~)M. Because C(t) is defined only up to an integer 
multiple of 27, ¢ is a function from [a,b] to R/27Z. It is a C* function, since it is 


locally a branch of cos~! (c'(t),€1,<@))/Ile’(t)||- Let Co be a real number such that 


(c'(a), €1c(a)) 

Ile") 
Since R is a covering space of R/27Z and the interval [a, b] is simply connected, we 
know from the theory of covering spaces that there is a unique C* map €: [a,b] > R 
with a specified initial value C (a) = Co that covers ¢: 


cos Cy = 


[4,6] —> R/2nZ, 


We call ¢: [a,b] > R the angle function with initial value Co along the curve c rela- 
tive to the frame e;,e2. Since an angle function along a curve is uniquely determined 
by its initial value, any two angle functions along the curve c relative to e),e2 differ 
by a constant integer multiple of 27. 


17.3 Signed Geodesic Curvature on an Oriented Surface 


Mimicking the definition of the signed curvature for a plane curve, we can give 
the geodesic curvature a sign on an oriented Riemannian 2-manifold. Given a unit- 
speed curve y(s): [a,b] + M on an oriented Riemannian 2-manifold M, choose a 
unit vector field n along the curve so that 7,n is positively oriented and orthonormal. 
Since (T,T) = 1 on the curve, 


d 
—(T,T) =0. 
<(1,T) =0 


DT 
2(—.T)=0. 
aT) 


Since the tangent space of M at y(s) is 2-dimensional and DT /ds is perpendicular to 
T, DT /ds must be a scalar multiple of n. 


By Theorem 13.2, 
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Definition 17.1. The signed geodesic curvature at a point y(s) of a unit-speed curve 
in an oriented Riemannian 2-manifold is the number k,(s) such that 


DT _ 


—— = k,n. 
ds e 


The signed geodesic curvature Ky can also be computed as 


DT 
oe a Pe ae 


Let U be an open subset of the oriented Riemannian 2-manifold M such that there 
is an orthonormal frame e;,e2 on U. We assume that the frame e1,e2 is positively 
oriented. Suppose y: [a,b] + U is a unit-speed curve. Let €: [a,b] + U be an angle 
function along the curve y relative to e),e2. Thus, ¢(s) is the angle that the velocity 
vector T(s) makes relative to e; at y(s) (Figure 17.1). In terms of the angle ¢, 

T= (cos€)e, +(sin€)eo, (17.1) 
n= —(sin€)e; + (cos€)ez, (17.2) 


where e}, 2 are evaluated at c(t). 


e} 


Fig. 17.1. The angle ¢. 


Proposition 17.2. Let w} be the connection form of an affine connection on a Rie- 
mannian 2-manifold relative to the positively oriented orthonormal frame e,,e2 
on U. Then the signed geodesic curvature of the unit-speed curve y is given by 


Proof. Differentiating (17.1) with respect to s gives 


DT d D d D 
a (<c0s6) e1 + (cos) * + (sing) en + (sing). 
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In this sum, e; really means e; .(;) and Dei eq) /ds = Vre; by Theorem 13.1 (iii), so 
that by Proposition 11.4, 


D 
7 =Vre) = w(T)e, =—@l(T)en, 
=Vrez = 03 (T)e 
s 
Hence, 
DT ‘ d 
a (sin ¢) —e; — (cos C)ah (T)e2 
Ss ds 
+ (cos) 5 -e2 + (sing); (T)e1 
d 
= ( — 4 (r)) n 
So 
d 
Ky =F —oh(7) 


Since Ky is a C® function on the interval [a,b], it can be integrated. The integral 
( Kg ds is called the total geodesic curvature of the unit-speed curve y: [a,b] + M. 


Corollary 17.3. Let M be an oriented Riemannian 2-manifold. Assume that the 
image of the unit-speed curve y: |a,b] > M is a 1-dimensional submanifold C with 
boundary. If C lies in an open set U with positively oriented orthonormal frame e , ez 
and connection form 3, then its total geodesic curvature is 


b 
J xeds=6(0)-6(@)— [o}. 
a Cc 
Proof. Note that y~! : C — [a,b] is a coordinate map on C, so that 


b 
[o=[ rel. 
Cc a 


Let s be the coordinate on [a,b]. Then 


Yo; = f(s)ds (17.3) 


for some C® function f(s). To find f(s), apply both sides of (17.3) to d/ds: 


fs) = (ro) (Z) = 0! (22) = alts) = o(7. 
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Hence, 7 @; = @}(T)ds. By Proposition 17.2, 


=6(6)—£(a)— f ah¢ryas 


17.4 Gauss—Bonnet Formula for a Polygon 


A polygon on a surface M is a piecewise smooth simple closed curve y: [a,b] > M; 
here “closed” means that y(a) = y(b) and “simple” means that the curve has no other 
self-intersections. We say that a polygon is unit-speed if it has unit speed everywhere 
except at the nonsmooth points. Suppose now that M is an oriented Riemannian 2- 
manifold and that y: [a,b] — M is a unit-speed polygon that lies in an open set U 
on which there is an oriented orthonormal frame e,e2. Let (so), ¥(s1),---;¥(Sm) 
be the vertices of the curve, where a = s9,b = Sp, and y(a) = y(b). Denote by C 
the image of y, and by R the region enclosed by the curve C (Figure 17.2). Let K 
be the Gaussian curvature of the oriented Riemannian 2-manifold M. 

In a neighborhood of a vertex y(s;), the curve is the union of two 1-dimensional 
manifolds with boundary. Let y'(s; ) be the outward-pointing tangent vector at y(s;) 
of the incoming curve, and let y'(s;") be the inward-pointing tangent vector at y(s;) 
of the outgoing curve. The angle ¢; in | — 2,7/[ from the incoming tangent vector 
7 (s; ) to the outgoing tangent vector y'(s;") is called the jump angle at y(s;). The 
interior angle at y(s;) is defined to be 


B; [= 1 — &. 


Along each edge y([s;_1,5;]) of a polygon, the velocity vector T changes its angle 
by AG; = €(s;) — ¢(s;_1). At the vertex y(s;), the angle jumps by ¢€;. Thus, the total 
change in the angle of T around a polygon is 


SY AG +> g. 
i=l i=1 


In Figure 17.2, it appears that the total change in the angle is 27. This is the content 
of the Hopf Umlaufsatz (circulation theorem), which we state below. On an oriented 
surface, a polygon is positively oriented if it has the boundary orientation of the 
oriented region it encloses. In the plane, positive orientation is counterclockwise. 


Theorem 17.4 (Hopf Umlaufsatz). Let (U,e1,e2) be a framed open set on an ori- 
ented Riemannian 2-manifold, and y: [a,b] + U a positively oriented piecewise 
smooth simple closed curve. Then the total change in the angle of T(s) = y'(s) 
around ¥ is 
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Fig. 17.2. A polygonal region on a surface. 


m m 
YAG+ > e; =27. 
i=l i=l 


The Hopf Umlaufsatz is also called the rotation index theorem or the rotation 
angle theorem. A proof may be found in [11] or [16, Th. 2.9, pp. 56-57]. In the 
Addendum we give a proof for the case of a plane curve. 


Theorem 17.5 (Gauss—Bonnet formula for a polygon). Under the hypotheses 
above, 


b m 
| Keds =2n—Y6,— | Kvol. 
a i=l R 


Proof. The integral { Keds is the sum of the total geodesic curvature on each edge 
of the simple closed curve. As before, we denote by A¢; the change in the angle ¢ 
along the ith edge of y. By Corollary 17.3, 


1) m Sj 
i Kds= >, | Kk, ds 
a i=17Si-1 
= Dag | ol. (17.4) 
i Cc 


By the Hopf Umlaufsatz, 
SAG = 20 - VG. (17.5) 
Recall from Section 12 that on a framed open set (U,e1,e2) of a Riemannian 
2-manifold with orthonormal frame e;,e2 and dual frame @', 6”, if [Q'] is the curva- 
ture matrix relative to the frame e), e2, then 


Q} = da; = K 0! A 6”. 


Therefore, 


= i do} (by Stokes’ theorem) 
R 


= | Ko! ne? = | K vol. (17.6) 
R R 


Combining (17.4), (17.5), and (17.6) gives the Gauss—Bonnet formula for a polygon. 
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17.5 Triangles on a Riemannian 2-Manifold 


A geodesic polygon is a polygon whose segments are geodesics. We will deduce 
from the Gauss—Bonnet formula a famous result on the sum of the angles of a 
geodesic triangle on a Riemannian 2-manifold. 

Proposition 17.6. Suppose a geodesic m-gon ¥ lies in a framed open set of an ori- 
ented Riemannian 2-manifold. If B; is the interior angle at vertex i of the geodesic 
m-gon, R is the closed region enclosed by y, and K is the Gaussian curvature on R, 
then 


LB (m—2)e+ f K vol, 


Proof. Along a geodesic m-gon, the geodesic curvature Kk, is identically zero. The 
exterior angle €; at vertex i is 

€j = 1 — Bi. 
Hence, the Gauss—Bonnet formula becomes 


0=2n—Si(e—Bi)~ [Kol 


or 


dFi= (m—2)n-+ | Kvol. 


The following corollary is immediate. 


Corollary 17.7. Suppose a geodesic triangle lies in a framed open set of an oriented 
Riemannian 2-manifold M. 


(i) If M has zero Gaussian curvature, the angles of the geodesic triangle add up to 
7m radians. 
(ii) If M has everywhere positive Gaussian curvature, the angles of the geodesic 
triangle add up to more than 1 radians. 
(iii) If M has everywhere negative Gaussian curvature, the angles of the geodesic 
triangle add up to less than 7% radians. 


By Section 12.5, the Poincaré half-plane has constant Gaussian curvature —1. 
It is also known as the hyperbolic plane. We will call a geodesic triangle in the 
Poincaré half-plane a hyperbolic triangle. 


Corollary 17.8. The area enclosed by a hyperbolic triangle is m minus the sum of its 
interior angles. 


Proof. If R is the region enclosed by the hyperbolic triangle, then by Proposi- 
tion 17.6, 


Area(R) = i 1 vol = ~ [ Kvol 
R R 


3 
= m— > Bi. 
i=1 
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Let M be a compact oriented Riemannian 2-manifold. Suppose M has been cut up 
into polygonal regions, each of which is bounded by a piecewise smooth simple 
closed curve and lies in a framed open set. We call such a polygonal decomposition 
sufficiently fine. Let V,E, F be the number of vertices, edges, and faces, respectively, 
of this polygonal decomposition. The Euler characteristic of the decomposition is 
defined to be 

X=V-E+F. 


Theorem 17.9 (Gauss—Bonnet theorem). For a compact oriented Riemannian 2- 
manifold M, 


| K vol = 227(M). 
M 


Proof. Decompose the surface M into a sufficiently fine polygonal decomposition. 
The orientation on the surface will orient each face in such a way that each edge is 
oriented in two opposite ways, depending on which of the two adjacent faces it is a 
boundary of (see Figure 17.3). 


S5/ 


Fig. 17.3. Orienting a triangulated surface. 


Denote the polygonal regions by R;. If we sum up f Kk, ds over the boundaries of 
all the regions R;, the result is zero because each edge occurs twice, with opposite 


orientations. Let pu ) 1 <i<vn,, be the interior angles of the jth polygon. Then by 
the Gauss—Bonnet formula for a polygon (Theorem 17.5), 


0=>/ Keds =20F -YS(n-B))->, [ Kvol. 
7 OR; j i=l pO Ri 


In the sum >, dj(% — BY )), the term 7 occurs 2E times because each edge occurs 
twice. The sum of all the interior angles is 27 at each vertex; hence, 


py = 2nV. 
ij 
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Therefore, 


0 =2nF —2nE +>. p\” J Kvol, 
ij M 
or 


| Kvol=2nV —2nE+2nF 
M 


= 27x(M). 


In the Gauss—Bonnet theorem the total curvature [,, K vol on the left-hand side 
is independent of the polygonal decomposition while the Euler characteristic 277 
on the right-hand side is independent of the Riemannian structure. This theorem 
shows that the Euler characteristic of a sufficiently fine polygonal decomposition 
is independent of the decomposition and that the total curvature is a topological 
invariant, independent of the Riemannian structure. 

By the classification theorem for surfaces, a compact orientable surface is classi- 
fied by an integer g called its genus, which is essentially the number of holes in the 
surface (Figure 17.4). 


SJ 


genus 0 genus | genus 2 


Fig. 17.4. Compact orientable surfaces. 


Any two compact orientable surfaces of the same genus are diffeomorphic and 
moreover, if M is a compact orientable surface of genus g, then 


x(M) =2—2¢. 


Theorem 17.10. [f a compact orientable Riemannian manifold of dimension 2 has 
positive Gaussian curvature K everywhere, then it is diffeomorphic to a sphere. 


Proof. By the Gauss—Bonnet theorem, 
2myx(M) =| Kvol > 0. 
M 


So y(M) = 2—2g > 0, which is equivalent to g < 1. Hence, g = 0 and M is diffeo- 
morphic to a sphere. 
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17.7 Gauss—Bonnet Theorem for a Hypersurface in R7"*! 


If M is an even-dimensional smooth, compact, oriented hypersurface in R*”*!, we 
defined its Gaussian curvature K in Section 9.1. The Gauss—Bonnet theorem for a 
hypersurface states that 


_ vol(S?") 
I, Kvo oa). 
For a proof, see [19, Vol. 5, Ch. 13, Th. 26, p. 442]. Forn = 1, vol(S*) = 47 and this 
theorem agrees with Theorem 17.9. 


Problems 


17.1. Interior angles of a polygon 
Use the Gauss—Bonnet formula to find the sum of the interior angles of a geodesic m-gon in 
the Euclidean plane. 


17.2. Total curvature of a plane curve 
Let C be a smooth, simple closed plane curve parametrized counterclockwise by y(s), where 
s € [0, €] is the arc length. If k(s) is the signed curvature of the curve at y(s), prove that 
é 
k(s)ds = 2n. 
0 


This is a restatement of Hopf’s Umlaufsatz, since if @ is the angle of the tangent vector 7’ (s) 
relative to the positive x-axis, then 


) 


f £de 

k(s)ds =[ a” ds=6(2) — 610), 
0 0 ds 
is the change in the angle 6 from s=0 tos = @. 


17.3. The Gauss map and volume 

Let M be a smooth, compact, oriented surface in R?. The Gauss map v: M > S? of M is 
defined in Problem 5.3. If voly and vol: are the volume forms on M and on a respectively, 
prove that 


v* (vols) = K voly, 
where K is the Gaussian curvature on M. 
17.4. Degree of the Gauss map of a surface 


If f: M — N is a map of compact oriented manifolds of the same dimension, the degree of f 
is defined to be the number 


aegf= | fo 


where @ is a top form on N with f,,@ = 1 [3, p. 40]. Prove that for a smooth, compact, 
oriented surface M in R3, the degree of the Gauss map v: M > S? is 


degv = sum), 


(Hint: Take @ to be volg /4a. Then degv = fy V* (vols) /42. Use Problem 17.3 and the 
Gauss—Bonnet theorem.) 
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17.5. Degree of the Gauss map of a hypersurface 

Let M be a smooth, compact, oriented 2n-dimensional hypersurface in R2”+!, Let N be the 
unit outward normal vector field on M. The Gauss map v: M sen assigns to each point 
p <M the unit outward normal Ny. Using the Gauss—Bonnet theorem for a hypersurface, 
prove that the degree of the Gauss map is 


degv = 5am), 


Addendum. Hopf Umlaufsatz in the Plane 


In this Addendum we give a proof of the Hop Umlaufsatz in the plane, more or less 
along the line of Hopf’s 1935 paper [11]. 

Let c: [a,b] + R? be a regular curve (Recall that “regular” means that c’ is never 
zero). Let €: [a,b] + R/27Z be the angle that the unit tangent vector makes with 
respect to the positive x-axis. Because angles are defined up to an integral multiple 
of 27, it is a C® function with values in R/27Z. Since the interval [a,b] is sim- 
ply connected, € has a C® lift €: [a,b] > R, which is unique if we fix the initial 
value €(a) € R. If c is piecewise smooth with vertices at s9 <5) <--+: <s;, then at 
each vertex c(s;), the angle at the endpoint of one segment and the jump angle there 
determines uniquely the angle at the initial point of the next segment. Thus, on a 
piecewise smooth curve there is a well-defined C™ angle function on each segment 
once the initial angle €(a) € R is specified. 


Theorem 17.1 (Hopf Umlaufsatz for a plane curve). Let y: [0,4] > R? be a piece- 
wise smooth simple closed curve oriented counterclockwise and parametrized by arc 
length. Then the change in the angle around y is 


£(0) —6(0) =2n. 


y(t2) 


Fig. 17.5. A simple closed curve. 


Proof. Let C = y((0, 4]) be the image of the parametrized curve. First consider the 
case when C is smooth. Since C is compact, it is bounded and therefore lies above 
some horizontal line. Parallel translate this horizontal line upward until it is tangent 
to C. Translate the coordinate system so that the origin O = (0,0) is a point of 
tangency with a horizontal tangent (Figure 17.5). 
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For any two points y(t;),7(t2) on C with t; < t, let €(t,,t2) be the angle that 


the unit secant vector y(t, ) y(t) /||Y(t1) y(t) || makes relative to the positive x-axis. If 
ty =h, let a (t1,t2) be the angle that the unit tangent vector at y(t; ) makes relative to 
the positive x-axis. 

Let T be the closed triangle in R? with vertices O = (0,0), A = (0,4), and L= 
(¢,£) (Figure 17.6). The angle function €: T > R/27Z is continuous on the closed 
rectangle T and C™ on the open rectangle. Since T is simply connected, 4 lifts to a 
continuous function ¢ : T — R that is C” in the interior of T. 


A(0,£) L(é,£) 


O(0,0) 
Fig. 17.6. Domain of the secant angle function. 


Then dé is a well-defined 1-form on T, and the change in the angle € around the 
curve Y is given by the integral 


pee ee ee) 
By Stokes’ theorem, 


[ab= [aas) =o, 
oT T 
where OT = OL+ LA+AO = OL—AL~— OA. Hence, 


[a= at+ | dt. (17.7) 
OL OA AL 


On the right the first integral /o;d¢ = ¢(0,/) — ¢(0,0) is the change in the angle 
as the secant moves from (0,0) to (0,2), i-e., the initial point of the secant is fixed 
at O while the endpoint moves from O to O counterclokwise along the curve (Fig- 
ure 17.7). Thus, 


[46 = 60,0 -£(0.0)=2-0=n. 
OA 
The second integral on the right in (17.7), 
[45 =600.0-600,9, 
AL 


is the change in the angle of the secant if the initial point of the secant moves from O 
to O counterclockwise along the curve while the endpoint is fixed at O (Figure 17.8). 
Thus, 
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C00 yo) 610.0) 


Fig. 17.7. Change in the secant angle from (0,0) to (0, @). 


Fig. 17.8. Change in the secant angle from (0, £) to (¢,@). 


[4s = £(0,0)—C(0,2) =2n—-n =n. 
So in (17.7), 


c()—co)= f ac= fags [ af =n+n=2n. 


This proves the Hopf Umlaufsatz for a smooth curve y. 

If y: [0,4] > IR? is piecewise smooth so that C has corners, we can “smooth 
a corner” by replacing it by a smooth arc and thereby removing the singularity 
(Figure 17.9). 

In a neighborhood of each corner, the change in the angle is the change in the 
angle over the smooth pieces plus the jump angle at the corner. It is the same as the 
change in the angle of the smoothed curve. Hence, 


Fig. 17.9. Smoothing a corner. 


6(2)—C(0) = VAG+ Dig = 22. 


Chapter 4 


Tools from Algebra and Topology 


This chapter is a digression in algebra and topology. We saw earlier that the tangent 
space construction gives rise to a functor from the category of C® manifolds (M, p) 
with a marked point to the category of vector spaces. Much of differential topology 
and differential geometry consists of trying to see how much of the geometric infor- 
mation is encoded in the linear algebra of tangent spaces. To this end, we need a 
larger arsenal of algebraic techniques than linear spaces and linear maps. 

In Sections 18 and 19, we introduce the tensor product, the dual, the Hom functor, 
and the exterior power. These functors are applied in Section 20 to the fibers of 
vector bundles in order to form new vector bundles. In Section 21, we generalize 
differential forms with values in R to differential forms with values in a vector space 
or even in a vector bundle. The curvature form of a connection on a vector bundle E 
is an example of a 2-form with values in a vector bundle, namely, the endomorphism 
bundle End(£). 

Although we are primarily interested in the tensor product of vector spaces, it is 
not any more difficult to define the tensor product of modules. This could come in 
handy in algebraic topology, where one might want to tensor two abelian groups, or 
in the theory of vector bundles, where the space of C™ sections of a bundle over a 
manifold is a module over the ring of C® functions on the manifold. All the modules 
in this book will be left modules over a commutative ring R with identity. When 
the ring R is a field, the R-modules are vector spaces. The vector spaces may be 
infinite-dimensional, except where explicitly stated otherwise. 


818 The Tensor Product and the Dual Module 


Some of the basic operations on a vector space are not linear. For example, any 
inner product on a vector space V is by definition bilinear, hence not linear on V x V. 
In general, linear maps are easier to deal with than multilinear maps. The tensor 
product is a way of converting multilinear maps to linear maps. The main theorem 
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(Theorem 18.3) states that bilinear maps on a Cartesian product V x W of vector 
spaces V and W correspond in a one-to-one manner to linear maps on the tensor 
product V @W. 


18.1 Construction of the Tensor Product 


Let R be a commutative ring with identity. A subset B of a left R-module V is called 
a basis if every element of V can be written uniquely as a finite linear combination 
Yribj, where r; € R and b; € B. An R-module is said to be free if it has a basis, and 
if the basis is finite with n elements, then the free R-module is said to be of rank n. 


Example 18.1. The Z-module Z/2 has no basis, for the only possible candidate for 
a basis element is 1, yet 0 = 0- 1 = 2-1, which shows that 0 is not uniquely a linear 
combination of 1. 


Let V and W be left R-modules. To define their tensor product, first construct the 
free R-module Free(V x W) whose basis is the set of all ordered pairs (v,w) € V x W. 
This means an element of Free(V x W) is uniquely a finite linear combination of 
elements of V x W: 


> ri(vi,wi), r,ER, (vi, wi) EVxW. 


In Free(V x W), consider the R-submodule S spanned by all elements of the form: 


(vi +v2,w) — (v1,w) — (v2,), 
(v,w1 + w2) — (v,w1) — (¥,w2), 
(rv,w) —r(v,w), vee 
) 


(vy, rw) —r(v,w 
for all v},v2,v € V, w1,w2,w € W, andre R. 


Definition 18.2. The tensor product V ©®zW of two R-modules V and W is the 
quotient R-module Free(V x W) /S, where S is the R-submodule spanned by elements 
of the form (18.1). When it is understood that the coefficient ring is R, we write 
simply V @W. 


We denote the equivalence class of (v,w) by v@ w and call it the tensor product 
of v and w. By construction, v ® w is bilinear in its arguments: 


(V1 +v2)@w=v1 @w+ww, (18.2) 
V@ (wi +w2) =Vv@w, +V@wo, (18.3) 
(rv) @w =r(v@w) =v@rw. (18.4) 


In other words, the canonical map 


®:Vxw-Vew 
(yw) v@w 


is bilinear. 
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Elements of the form v @ w are said to be decomposable in V ® W. By construc- 
tion every element in V © W is a finite sum of decomposable elements: 


¥ ri(vi® wi) = ¥(rivi) @wi- 


The decomposition into a finite sum of decomposable elements is not unique; for 
example, 
V1 Qw+v2@w = (vj + v2) @w. 


18.2 Universal Mapping Property for Bilinear Maps 


The tensor product satisfies the following universal mapping property. 


Theorem 18.3. Let V,W,Z be left modules over a commutative ring R with identity. 
Given any R-bilinear map f: V x W — Z, there is a unique R-linear map f: V ® 
W — Z such that the diagram 


VOW 
tN 
es Mera 


(18.5) 
commutes. 
The commutativity of the diagram means that f o ® = f, or 
f(v@w) =f(v,w) forall (v.w) eV x W. 


Proof (of Theorem 18.3). In general there is a one-to-one correspondence between 
linear maps on a free modules and set maps on a basis of the free module. We define 


F: Free(V xW) > Z 


by setting 
F(v,w) = f(v,w) 


and extending the definition by linearity: 


F(Drivi.wi)) = >Yinfwi). 


Because f is bilinear, F vanishes on all the generators (18.1) of the subspace S. For 
example, 


F ((v1 +v2,w) —(v1,w) — (v2,w)) = F(vi +2, w) — F(v1,w) — F(v2,w) 
= f(v1 +v2,w) — f(vi,w) — f(v2,W) 
=0. 
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So the linear map F induces a linear map of the quotient module 


_ _ Free(V x W) 


f:Vaw 5 eZ 


such that 
f(v@w) = F(v,w) = f(v,w). 


This proves the existence of the linear map f. 
To prove uniqueness, suppose ¢: V ® W —> Z is another linear map that makes 
the diagram (18.5) commutative. Then 


8(v@w) = f(v,w) = f(vew) 


for all (v,w) € V x W, so g and f agree on decomposable elements. Since the decom- 
posable elements span V © W and both g and f are linear, g = f onallofV@W. 


18.3 Characterization of the Tensor Product 


Let V and W be left modules over a commutative ring R with identity. We say that a 
left R-module T and an R-bilinear map ¢: V x W — T have the universal mapping 
property for bilinear maps on V x W if given any left R-module Z and any R-bilinear 
map f: V x W —+ Z there is a unique R-linear map f: T — Z making the diagram 


Ee 


a 


NE re 


Sk 


commutative. The universal mapping property characterizes the tensor product in 
the following sense. 


Proposition 18.4. Suppose V, W, and T are left R-modules and ¢: V x W — T an 
R-bilinear map satisfying the universal mapping property for bilinear maps. Then T 
is isomorphic to the tensor product V ® W via the linear map ®: T > V QW. 


Proof. By the universal mapping property of 7, there is a unique linear map ®: T > 
V @W such that the diagram 
® 
o : 
o 


commutes. Algebraically, 


for all (vw) € V x W. 
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Similarly, by the universal mapping property of V ® W, there is a map @: V ® 
W > T such that 6(v@w) = 6(v,w). Thus, 


(@ ° &)(0(v,w)) = 6(v@w) = o(v,w), 


so @ 0®: T -+T isa linear map that makes the diagram 


T 
‘| oo® 
ore d 


commute. 
On the other hand, the identity map 17: T — T also makes the diagram com- 
mute. By the uniqueness statement of the universal mapping property, 


Similarly, 
&o¢ =lvaw. 


Therefore, ®: T + V @W isa linear isomorphism of R-modules. 


As an example of the universal mapping property, we prove the following propo- 
sition. 


Proposition 18.5. Let V and W be left R-modules. There is a unique R-linear iso- 
morphism 


f:V@W-Wev 
such that 

f(v@w)=w@v 
forallv€V andwe W. 
Proof. Define 

f:Vxw-Wev 


by 
f(v,w) =w®v. 


Since f is bilinear, by the universal mapping property, there is a unique linear map 
f: VQW > WV such that 


(Fo @)(v,w) =wey, 


or 
f(v@w) =we». 


156 §18 The Tensor Product and the Dual Module 


Similarly, one can construct a unique linear map g: W@®V — V @W such that 
Bw@v) =vew. 


Then f and @ are inverses to each other on decomposable elements v @ w. Since 
V QW is generated by decomposable elements, go f = lyaw and fo Z= lway. 


Remark 18.6. In general, one cannot define a linear map on V @ W by simply defin- 
ing it on the decomposable elements v ® w and then extending it by linearity. This is 
because an element of the tensor product V @ W can have many different representa- 
tions as a sum of decomposable elements; for example, 


2Qw=ve2w=ve@wt+vew. 


There is no assurance that the definition of a map on the decomposable elements will 
be consistent. One must always start with a bilinear map f on the Cartesian product 
V x W. Then the universal mapping property guarantees the existence of a unique 
linear map f on V @W with the property that f(v@w) = f(v,w). 


18.4 A Basis for the Tensor Product 
In this section we assume V and W to be free R-modules of finite rank with bases 
V1,--+,V¥,_ and w1,...,Wm, respectively. 
Lemma 18.7. For 1 <k<nand1<é<m, there exist linear functions fs V® 
W > R such that 
ke J 

f (Vi@w;) = 5 i) 
foralll <i<nand\1<j<m 
Proof. Define h’: V x W — R by setting 


key) — slke) 
h (vi,wj) = 67.5) 


on the basis elements v; of V and w; of W and extending to V x W by bilinearity. 
As defined, h'’: V x W > R is bilinear, and so by the universal mapping property it 
corresponds to a unique linear map 


hi’: V@WsR 
such that 


; ke 
HE (vj w3) = hE (vi,wy) = 8G - 


Sete, 
Theorem 18.8. [fv1,...,V, is a basis for V and w,,...,Wm is a basis for W, then 


{vi@wj|1<i<n, 1<j<m} 


is a basis for V @W. 
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Proof. Every decomposable element v ® w can be written as a linear combination of 
Vv; ®w;, for if v = Y.a'v; and w = > b/w,, then 


vQw= Yi a'b/v; @ wj. 


Since every element of V © W is a sum of decomposable elements, we see that the 
set {vj ®w,}i; spans V @W. 

It remains to show that the set {v;® w;}i,; is linearly independent. Suppose there 
is a linear relation 


Yicivi@w; = 0, cl ER. 
ij 


Applying the linear function f* of Lemma 18.7 gives 
0= fH (Ycivi@w;) - Yc f(y, @w;) 
i,j i,j 
_ ij s(k.e) 
= de 5.1) 


= cH, 


This proves that the set {v; ® w;};,; is linearly independent. 


Corollary 18.9. If V and W are free R-modules of finite rank, then 


rk(V @ W) = (rkV) (rk W). 


18.5 The Dual Module 


As before, R is a commutative ring with identity. For two left R-modules V and W, 
define Homr(V,W) to be the set of all R-linear maps f: V + W. Under pointwise 
addition and scalar multiplication: 


(f+8)(v) =f) +80), 
(‘A)(r)=r(FO)), fg € Homa(V,W),veV,r eR, 
the set Homr(V,W) becomes a left R-module. If the ring R is understood from the 


context, we may also write Hom(V,W) instead of Homr(V,W). The dual VY of a 
left R-module V is defined to be Homa(V, R). 


Proposition 18.10. Suppose V is a free R-module of rank n with basis e1,...,€n- 
Define a’: V + R by 
a'(e;) = 6j. 


Then the dual VY is a free R-module of rank n with basis o',... 00". 
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Proof. Problem 18.2. 


Corollary 18.11. Jf V and W are free R-modules of rank n and m, with bases {v;} 
and {wj}, respectively, then the functions 


f@: VOW OR, 
fF (yi@w;)) =50), 1Sk<n, 1<e<m, 


of Lemma 18.7 constitute the dual basis for (V®W)* dual to the basis {v; ® w;}i,j 
for V@w. 


18.6 Identities for the Tensor Product 


This section contains a few important identities involving the tensor product. We 
leave some of the proofs as exercises. 


Proposition 18.12. Let V be a left R-module. Scalar multiplication 


fi RXV OV, 
(rv) rv 


is an R-bilinear map that induces an R-linear isomorphism f : R@RV — V. 


Proof. The bilinear map f: R x V > V induces a linear map f: R®V — V. Define 
g: V > RV by g(v) = 1 ®v. Then g is R-linear and 


gof =Lrey and fog=ly. 


Therefore, f is an isomorphism. 
Example 18.13. For any positive integer m, it follows from the proposition that there 
is a Z-isomorphism 


Z®z(Z/mZ) ~ Z/mZ. 


For a € VY and w € W, denote by a( )w € Homa(V,W) the R-linear map that 
sendv EV to a(v)we W. 


Proposition 18.14. Let V and W be free left R-modules of finite rank. There is a 
unique R-linear isomorphism 


f: VY @W > Homa (V,W) 


such that 
f(a@w) =a )w. 
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Proof. Define f: VY x W + Homr(V,W) by 
f(a,w) = a( )w. 


Since f is R-bilinear, it induces a unique R-linear map f: VY @W > Hom(R,W) 
such that 

f(a@w) =a )w. 
Let {v;}, {w;}, and {a} be bases for V, W, and VY respectively. Then a basis for 
VY @W is {a' @w;} and a basis for Homr(V,W) is {a'( )w;} (Show this). Since 
ff: VY @W — Homa(V,W) takes a basis to a basis, it is an R-linear isomorphism. 


Proposition 18.15. Let V and W be free left R-modules of finite rank. There is a 
unique R-linear isomorphism 


f: VY QW’ = (Veaw)’ 
such that f(&@ B) is the linear map that sends v®w to o(v)B(w). 


Proof. Problem 18.4. 


Example 18.16 (Tensor product of finite cyclic groups). For any two positive integers 
m,n, there is a group isomorphism 


Z/mZ®Z/nZ ~ Z/(m,n)Z, 


where (m,n) is the greatest common divisor of m and n. 


Proof. Define a Z-linear map Z + Z/(m,n)Z by a> a (mod (m,n)). Since mZ 
is in the kernel of this map, there is an induced Z-linear map Z/mZ — Z/(m,n)Z. 
Similarly, there is a Z-linear map Z/nZ — Z/(m,n)Z. We can therefore define a 
map f: Z/mZ x Z/nZ > Z/(m,n)Z by 


f(a,b) =ab (mod (m,n)). 


Clearly, f is bilinear over Z. By the universal mapping property of the tensor product, 
there is a unique Z-linear map f: Z/mZ@Z/nZ — Z/(m,n)Z such that f(a@b) = 
ab (mod (m,n)). 

By the third defining property of the tensor product (18.4), a®b = ab(1@1). 
Thus, the Z-module Z/mZ ® Z/nZ is cyclic with generator 1@1. It remains to 
determine its order. It is well known from number theory that there exist integers x 
and y such that 

(m,n) = mx -+ny. 
Since 


(m,n)(1@ 1) =mx(1@1)+ny(1@1) 
=mx@1+1@ny=0, 


the order of 1 @ 1 is a factor of (m,n). 
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Suppose d is a positive factor of (m,n), but d < (m,n). Then f(d(1@1)) = 
f(d®1)=d #0 in Z/(m,n)Z. Hence, d(1@1) #0. This proves that 1 @ 1 has 
order (m,n) in Z/mZ ® Z/nZ. Thus, Z/mZ ® Z/nZ is cyclic of order (m,n). 


In particular, if m and n are relatively prime, then 


Z/mZ®Z/nZ~ Z/Z =0. 


18.7 Functoriality of the Tensor Product 


In this section we show that the tensor product gives rise to a functor. 


Proposition 18.17. Let f: VV’ and g: W > W' be linear maps of left R-modules. 
Then there is a unique R-linear map 


f®g:VEWV'ewWw’ 

such that (f ® g)(v@w) = f(v) @ g(w). 
Proof. Define 

h:VxWV'@wW' 
by 

h(v,w) = f(v) @g(w). 
This A is clearly an R-bilinear map. By the universal mapping property 
(Theorem 18.3), there is a unique R-linear map 


h: VQEW 3V'ew' 


such that 


h(v@w) =hv,w) = f(v) @g(w). 


The map h is our f @ g. 

Thus, the tensor product construction associates to a pair of left R-modules 
(V,W) their tensor product V @W, and to a pair of R-linear maps (f: V > V',g: W> 
W’) the R-linear map 

fg: VEWV'QW’. 
It is not difficult to check that this construction satisfies the two properties of a 
functor: 
Ga) Ifly: VV and lw: W — W are identity maps, then 
ly @lw: V@W-VeWwW 

is the identity map on V @W. 
(ii) If (f: V > V',g: W > W’) and (f’: V' 3 V",9': W’ = W") are pairs of 

R-linear maps, then 

(f' 88") o(f@g) =(f'° f)@(s'e8). 


So the tensor product is a functor from the category of pairs of R-modules and 
pairs of linear maps to the category of R-modules and linear maps. 
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18.8 Generalization to Multilinear Maps 


The tensor product construction can be generalized to an arbitrary number of factors. 
For example, if U, V, and W are left R-modules, we can construct the tensor product 
U®V®W by following the same procedure as before. First, form the free R-module 
Free(U x V x W) with basis elements all (u,v,w) € U x V x W. Then let S be the 
R-submodule of Free(U x V x W) spanned by all “trilinear relations”: 


(uy +u2,v,W) _ (u1,¥, w) _ (u2,V,W), 
(ru,v,w)—r(u,v,w), and so on. 
The tensor product U @ V ® W is defined to be the quotient module Free(U x V x 
W)/S. The natural map 
@®:UxVxW-U®8VEW 
(u,v,w) HUSvOw 


is trilinear and satisfies the universal mapping property for trilinear maps on 
UxVxwW. 


18.9 Associativity of the Tensor Product 


Proposition 18.18. [fU, V, and W are left R-modules, then there is a unique R-linear 
isomorphism 
US(VEW) +USVEW 


that sends u® (v®w) tou®v@w for alluc U, ve V andweW. 
Proof. It suffices to prove that U @ (V @ W) together with the trilinear map 


¢:UxVxW 5>UB(VOW) 
(u,v,w) > u®(v@w) 


satisfies the universal mapping property for trilinear maps on U x V x W. As in 
Proposition 18.4, any two spaces satisfying the universal mapping property for tri- 
linear maps on U x V x W are isomorphic. 

Let Z be any vector space and f: Ux V x W — Z a trilinear map. Fix u € U. 
Since f (u,v, w) is bilinear in v and w, there is a unique linear map f(u, ): V@W + Z 
such that 


f(u,v@w) = f(u,v,w). 


Thus, there is a commutative diagram 


Ux (V@W) 


a ed Gar aa 
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Now f is bilinear in its two arguments, so there is a unique linear map 
f:U®(V@W) 9UEaVEW 
such that 


f(u®(v@w)) = f(u,v@w) = f(u,v,w). 


Hence, 6: Ux V x W + U®(V @W) satisfies the universal mapping property for 
trilinear maps on U x V x W, and there is a commutative diagram 


UeVEW) 


Taking Z=US@V OW and f=8:UxVxW > USV®W, we get 
f:U®(V@W) 9>UEVEAW 


such that 


f(u® (v@w)) = f(u,v,w) =u@ve@w. 


By the trilinear analogue of Proposition 18.4, f is an isomorphism. 
In the same way we see that there is an isomorphism 
(U@V)@W>U8VEaW 
that takes (u@v) @w tou@v@w. 
Corollary 18.19. If U, V, and W are left R-modules, then there is a unique R-linear 


isomorphism 
U®(V@W) + (U@V)@aW 
such that 
u®(v@w) + (u®@®v)@w 
forallucU,veEV andwe W. 


18.10 The Tensor Algebra 
For a left R-module V, we define 
T°(V)=R, TiV)=V, T?(V)=VeV,..., 


T'(V) =V@---@V, ..., 
k 
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and 


Here the direct sum @ means that each element v in T(V) is uniquely a finite sum 
v= ivy, where vy, € T*(V). Elements of T*(V) are said to be homogeneous of 
degree k. There is a multiplication map on T(V): first define 
erVsrVvjsr yy), 
(x,y) 4x®y, 
and then extend yp to T(V) x T(V) > T(V) by R-bilinearity. In this way, T(V) 


becomes a graded R-algebra, called the tensor algebra of V. By Corollary 18.19, the 
tensor algebra T(V) is associative. 


Problems 


In the following problems, R is a commutative ring with identity. 


18.1. Free R-modules of rank n 
Show that a free R-module of rank n is isomorphic to R@---@R (n copies). 


18.2. The dual of a free module 
Prove Proposition 18.10. 


18.3. Hom of free modules 
Prove that if V and W are free left R-modules of rank n and m, respectively, then Homr(V, W) 
is a free R-module of rank mn. 


18.4. Dual of a tensor product 
Prove Proposition 18.15. 


18.5. Tensor product of finite cyclic groups 
Let m and n be two positive integers. Identify Z/mZ ®@z Z/nZ as a finite abelian group. 
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$19 The Exterior Power 


In this section, R denotes a commutative ring with identity and all R-modules are 
left R-modules. Let V be a left R-module and let V‘ = V x --- x V be the Cartesian 
product of k copies of V. An alternating k-linear map from V to R is a k-linear map 
f: V*« — R that vanishes whenever two of the arguments are equal, i.e., 


fe yore ye) =O. 


Just as the tensor product solves the universal mapping problem for multilinear maps 
over R, so we will now construct a module that solves the universal mapping problem 
for alternating multilinear maps over R. The question is, given an R-module V, 
does there exist an R-module W such that alternating k-linear maps from V* to an 
R-module Z correspond canonically in a one-to-one way to linear maps from W to Z? 
The construction of the exterior power a V answers the question in the affirmative. 


19.1 The Exterior Algebra 


For a left R-module V, let /(V) be the two-sided ideal generated by all elements of 
the form v @ v in the tensor algebra T(V), for v € V. So /(V) includes elements such 
asa®@v@v@b fora,beT(V). 


Definition 19.1. The exterior algebra /\(V) is defined to be the quotient algebra 
T(V)/I(V). We denote the image of vj ®@--- @ vg by 


Vp Ar AVp. 


Such an element is said to be decomposable and the operation / is called the wedge 
product. 


Define \*(V), the kth exterior power of V, to be the image of T*(V) in A(V) 
under the projection T(V) — A(V). Then there is a canonical module isomorphism 
AW) LV). Pav) 
~ TEV)NV) FEV)’ 


where I‘(V) := T*(V) M1(V) consists of the homogeneous elements of degree k in 
I(V). In this way the exterior algebra (\(V) inherits a grading from the tensor algebra 
and itself becomes a graded R-algebra. We often write A‘ V for AX(V). 


19.2 Properties of the Wedge Product 


Let R be a commutative ring with identity and V a left R-module. In this subsection, 
we derive properties of the wedge product on the exterior algebra \(V). 
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Lemma 19.2. Let V be an R-module. In the exterior algebra \(V), 
u\v=—viAu 
forallu,v €V. 


Proof. For any w € V, we have by definition w ® w € I(V), so that in the exterior 
algebra |\(V), the wedge product w/w is 0. Thus, for all u,v € V, 


0=(u+v)A(u+yv) 
=uAut+tuAv+vAu+vAvV 
=u/Av+vAu. 


Hence, uA v= —vAu. 


Proposition 19.3. [fu € \‘V andve€ AV, thenuAve \i*"V and 
uAv=(—-1)vAu. (19.1) 


Proof. Since both sides of (19.1) are linear in u and in v, it suffices to prove the 
equation for decomposable elements. 
So suppose 
uU=ujy/A---Au, and v=vyA---Avge. 


By Lemma 19.2, vj Auj = —uj Avi. InuAv, to move v; across uy A\--: Aug requires 
k adjacent transpositions and introduces a sign of (—1)*. Similarly, moving v2 across 
u; \-++/A ug also introduces a sign of (—1)*. Hence, 


uAv=(—1)kyy Au A+ AugAvoA--Ave 
= (—1)¥(-1)*v, Ava Au A+ Aug AV3 A+ Ave 


(—1)¥y, A---AveAu A: Aug 
= (-1)vnu. 


Lemma 19.4. In a decomposable element v, \ +++ Avg € a V, with each v; € V, a 
transposition of v; and v; introduces a minus sign: 


VA Avi Ar AVI At AVR = HV Ar AVA AVA AVR. 


Proof. Without loss of generality, we may assume i < j. Let a=vj4) A---Avj-1. It 
suffices to prove that 
viAaAvj=—vjAaAyj. 


By Proposition 19.3, 
(vj Aa) Av; = (—1)884t yA (Aa) 
= (i (—1)*84y; A (an vi). 
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Proposition 19.5. Jf is a permutation on k letters and v; € V, then 
Va(1) Av AVa(k) = (sgn7)v1 A+++ Avg. 


Proof. Suppose 7 can be written as a product of @ transpositions. Then sgn(z) = 
(—1)*. Since each transposition of the subscripts introduces a minus sign, 


Va(1) A+ ** A Va(k) = (-1), A-- Aye = (sgn 7) vy A--- Avg. 


19.3 Universal Mapping Property for Alternating k-Linear Maps 


If V is a left R-module, V* is the Cartesian product V x --- x V of k copies of V, and 
Ae V is the kth exterior power of V, then there is a natural map \: Vk AN V5 


A(V1,---5 VK) = Vi Ar AVR. 
This map is clearly k-linear. By Proposition 19.5, it is also alternating. 


Theorem 19.6 (Universal mapping property for alternating k-linear maps). For 
any R-module Z and any alternating k-linear map f : V‘ —> Z over R, there is a 
unique linear map f : A‘ V — Z over R such that the diagram 


Ae V 


commutes. 


The commutativity of the diagram is equivalent to 


Flv A= Avg) = f(v15- +05 Ve) (19.2) 


for all v1,...,v,in V. 


Proof. Since f is k-linear, by the universal mapping property of the tensor product, 
there is a unique linear map h: T*V — Z such that 
A(v1 @++-@ ve) = f(V1,---5 Ve): 


Since f is alternating, 
h(---@v@v®--)=f(...,v,vy...)=0 forallvevV. 


So h vanishes on the submodule /‘(V), and therefore h induces a linear map on the 
quotient module 
i if 


k 
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such that 


F(VIA +++ Avg) = A011 @ +++ @ vg) = f (V1, 5 Ve) 

This proves the existence of /: AV tz, 
To prove its uniqueness, note that (19.2) defines f on all decomposable elements 
in ‘Vv. Since every element of A'V is a sum of decomposable elements and f is 
linear, (19.2) determines uniquely the value of f on all of has 


For a module V over a commutative ring R with identity, denote by L;,(V) the 
R-module of k-linear maps from V* to R, and A;,(V) the R-module of alternating k- 
linear maps from V* to R. By the universal mapping property for the tensor product, 
k-linear maps on V* may be identified with linear maps on the tensor power T'V: 


L(V) = (TRV)”, 
dees 
where 
F(V1,--+5 Ve) = f(v) @-+-@vy). 


Similarly, by the universal mapping property for the exterior power, alternating 
k-linear maps on V may be identified with linear maps on Nv: 


Ay (V) © (A\v) (19.3) 
are 
where 


f(v1,---,¥E) = Fv Av Avy). 


19.4 A Basis for \‘V 


Let R be a commutative ring with identity. We prove in this section that if V is a 
free R-module of finite rank, then for any k the exterior power AV is also a free 
R-module of finite rank. Moreover, from a basis for V, we construct a basis for N V. 


Lemma 19.7. [f B = {e1,...,€n} is a basis for the free R-module V, then 
ep A+++ Ae, £0. 
It follows that for 1 <iy <-++ <ig <n, we have e;, \--- Ne, £0. 


Proof. It will be enough to define a linear map /\"V — V that takes e;) \--- Ae, toa 
nonzero element. First define an alternating set map f: B” > R by 


flex), cet :€n(n)) = sgn(z), 


where 7 is a permutation of {1,...,n}. Next extend f to ann-linear map f: V" > R, 
which is also alternating because it is alternating on basis elements. Indeed, 
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r(. - De asia) Papen) 
= b) flied irnyejyecs) (by n-linearity) 
SY A tye Ghai) (by definition of f) 
Sf ae Deca eee) (by n-linearity) 


By the universal mapping property for alternating m-linear maps, there is a unique 
linear map 


Ve A\'V>R 


such that the diagram 


commutes. In particular, 
Fler A:+:Aen) = f(e1,..-5€n) =]. 


This shows that e; \---Ae, £0. 


Theorem 19.8. If {¢1,...,@n} is a basis for V, then 
Si= {e, A---Ag, | li <-<& <n} 
is a basis for \KV. 
Proof. Since the exterior power A‘V isa quotient of the tensor power T*V and 
{e;, @-+-@e, | 1<is,...,i% <n} 


is a basis for T*V, the set S spans A‘v. 
It remains to show that the set S is linearly independent. We introduce the multi- 
index notation J = (1 <i) <-+-<ig <n) and 


er = i, N+ Nei. 
Suppose there is a linear relation 
Yid'e; =0, (19.4) 


where J runs over all multi-indices 1 <i) <--- <i, <noflengthk. If J = (jj <-:-< 
Jk) is one particular multi-index in this sum, let J’ be its complement, also arranged 
in increasing order. By definition, ey and ey have no factor in common, but if J and 
J both have length k and J 4 J, then there is an e; in e; that is not in e;, so that e; and 
ey will have e; in common. Hence, 
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es\ep = xe, A:::Aen, 
ey \ep =O, if AJ. 


Taking the wedge product of (19.4) with ey gives 


O= (Ya'e7) Aer= ale; Ney = tale, A+++ Aen. 


Since e; A---Aé, 40 (Lemma 19.7), a’ =0. So the set S is linearly independent 
and hence is a basis for the exterior power az 


Corollary 19.9. If V is a free R-module of rank n, then NV is a free R-module of 
rank (')). 


19.5 Nondegenerate Pairings 


Let R be a commutative ring with identity. A pairing of two R-modules V and W is 
a bilinear map 
(,):Vxwor. 


The pairing is said to be nondegenerate if 
(yw) =Oforalwew = v=0, 


and 
(yw) =OforallveV => w=0. 


Example 19.10. An inner product on a real vector space V is a nondegenerate pairing 
of V with itself, since if (v,w) = 0 for all w € V, then (v,v) = 0 and hence v = 0. 
Similarly, (v,w) = 0 for all v € V implies that (w,w) = 0 and hence w = 0. 


If there is a pairing ( , ) between two left R-modules V and W, then each v in V 
defines a linear map 


(vy, ): WR. 


So the pairing induces a map V > WY, given by 
ves (vy, ). 


This map is clearly R-linear. Similarly, the pairing also induces an R-linear map 
W > VY viaw+ ( ,w). The definition of nondegeneracy says precisely that the 
two induced linear maps V — WY and W > VY are injective. For finite-dimensional 
vector spaces this is enough to imply isomorphism. 


Lemma 19.11. Let V and W be finite-dimensional vector spaces over a field R. If 
(, ): Vx W = Ris a nondegenerate pairing, then the induced maps V + WY and 
W > VY are isomorphisms. 
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Proof. By the injectivity of the induced linear maps V > WY and W > VY, 
dimV <dimWY and dimW <dimV”. 
Since a finite-dimensional vector space and its dual have the same dimension, 
dimV < dimWY = dimW < dimVY = dimV. 


Hence, dimV = dimW. So the injections V > WY and W > VY are both isomor- 
phisms. 


Any finite-dimensional vector space V is isomorphic to its dual VY, because they 
have the same dimension, but in general there is no canonical isomorphism between 
V and VY. If V is finite-dimensional and has a nondegenerate pairing, however, then 
the pairing induces a canonical isomorphism 


VoV’, 
vis (y, ). 


19.6 A Nondegenerate Pairing of \*(V”) with \‘V 


Let V be a vector space. In this subsection we establish a canonical isomorphism 
between /\*(VY) and Ce v)’ by finding a nondegenerate pairing of of A\*(VV) 
with \‘V. 


Proposition 19.12. Let V be a module over a commutative ring R with identity. The 
multilinear map 


VOR VES R, 
((B',---sB*), (v15---5¥%)) > detlB(v,)] 
induces a pairing 
Aw’) x AV aR. 
Proof. For a fixed (B',...,B*) € (VY), the function: VF > R 
(v1,..-,¥%) + det[B'(v;)] 


is alternating and k-linear. By the universal mapping property for A‘V, there is a 
unique linear map A‘V — R such that 


vi A+++ Avg det[Bi(v;)]. 


In this way we have constructed a map 


h: (VY) (A‘v) 
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such that 
n(p!,..., BY (1 A+++ Avg) = det[B'(v,)]. 


Since h(B',...,B*) is alternating k-linear in B',...,B*, by the universal mapping 
property again, there is a linear map 


‘5 k k Vv 
h: MY) > (AV) 
such that 
h(B' A+ AB*)(vi Av Avy) = det[B'(v,)]. 
The map / gives rise to a bilinear map 
k k 
K\ VY)x\ VR 


such that 


(BL A---A BE vi A+++ Avg) + det[B‘(v;)]. 


Theorem 19.13. [fV is a free R-module of finite rank n, then for any positive integer 
k <n, the linear map 


% Vv 
i: NW) > (A'V) 
(BL A+ AB) (V1 A-++ Avg det[B!(v,)]), (19.5) 
from the proof of Proposition 19.12 is an isomorphism. 


Proof. To simplify the notation, we will rename as f the map h. Since duality leaves 
the rank of a free module unchanged, by Corollary 19.9 both A*(VY) and (A*‘V)Y 
are free R-modules of rank ean We will prove that f maps a basis for Ke (VY) toa 
basis for (\*V)Y, and is therefore an isomorphism. 
To this end, let ¢1,...,@, be a basis for V, and @!,...,@" its dual basis for VY. 
Then 
{er := ei, A\-+-Ae, | LS i <--- <i <n} 


is a basis for Ng V, and 
fol = an. Aakt|1<i <- <i <n} 
is a basis for \K(VY). Denote by e the basis for (A v) ‘ dual to {e7}, defined by 
e7 (es) = Ory. 
We claim that f(a!) = e7. First consider f(a!) (e,;) for 1 =J: 


f(a") (er) = f(a A---Aat®)(e, A+ Ae) 
= det[a"(e;,)] = det[5’] = 1. 
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Next, if 7 A J, then some i,, is different from j),..., jz. Then 
f(a")(e7) = det[or'"(e;,)] = 0, 


because the ro-th row is identically zero. Thus, f(a’) = e7 for all multi-indices /. 
Since f maps a basis to a basis, it is an isomorphism of free R-modules. 


Theorem 19.13 and (19.3) together give a sequence of module isomorphisms 
k k \V 
NWw')= (AV) 2 Adv), (19.6) 


19.7 A Formula for the Wedge Product 
In this section, V is a free module of finite rank over a commutative ring R with 


identity. According to (19.6), an element B! \ --- A B* in \‘(VY) can be interpreted 
as a k-linear map V x --- x V + R. Thus, if @ € A‘(VY) and B € \“(VY), then 


ABE Ai" (VY) ~Anse(V) 


can be interpreted as a (k + @)-linear map on V. We will now identify this (k + ¢)- 
linear map a A B: V'+" — R. 


Definition 19.14. A (k,2)-shuffle is a permutation 7 € S,.¢ such that 


m(1)<+:-<am(k) and aw(k+1)<---<a(k+2). 


Proposition 19.15. For a € \‘(VY), BE A‘(VY), and vj, € V, 
(OA B)(V1,..-5 Vere) = ¥ sgn() Oe (Vays +s Vay) B (Vaceet)> «<9 Veeee)) » 
1 


where 1 runs over all the (k, £)-shuffles. 


Proof. Since both sides are linear in @ and in B, we may assume that both o and B 
are decomposable, say 


a=ala---Aadk, B=Bla--- AB. 


Given any permutation 


s= 1s k k+l +s k+é cs 
p(1) ++ p(k) p(k+1) ++ p(k+Q] ~ 7 
we can turn it into a (k,@)-shuffle as follows. Arrange p(1),...,9(k) in increasing 


order 


pli) <-++ < plik) 
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and define o(j) = i; for 1 <j <k. Similarly, arrange p(k + 1),...,p(k+@) in 
increasing order 
Plikt1) <+++ < Plic+e) 
and define t(j) =i; fork+1< j<k+2. Then p oo Tisa (k, €)-shuffle. 
Thus, we can get all permutations of 1,...,4-+2@ by taking all (k, @)-shuffles and 
permuting separately the first k elements and the last ? elements. Let S¢ denote the 
group of permutations of k+1,...,k+@. Then using (19.5), 


(or Asse OF AB A=W BV Giie sos ieee) 
= © sgn(p)o! (vp) +++ oe (vp(ay) B (vp erty) «BY (Mpeg) 


PESK +e 


= Y= 2» YX sgn(x)sgn(o)sgn(c) 


(k,€)-shuffles 7 OCS, TES? 
Gt! (Ym(o(a))) °° & (Ym(o(h))) BY (Ymce(e+t))) “+ BY (acer) (9-7) 
(The sign comes from the fact that p = 170 T0<.) 
Now set wj = vz). Then wo(j) = Va(o(j))> $0 
X sen(o)or! (vaio(1))) +++ &* (vx(o(e)) 
OES; 


= > sen(o)o! (worry) +++ &* (Woy) 


OES, 
(al A---Aat*)(w,..., We) 


= (a'A---Aat*) (v_ays---s¥ae))- 


Thus, the sum (19.7) is 


by sgn()(al A---A ak) (va(1)s<-+5¥a(K)) 
(k,£)-shuffles 7 


(BE A--- AB) (vate) Yale+e)) 


= = sgn(7) a (vac1); + Yak) B (va(et1)> . Van(k+0)) . 
(k,¢)-shuffles 7 


Problems 


In the following problems, let V be a left module over a commutative ring R with identity. 
19.1. Symmetric power of an R-module 
A k-linear map f: V‘ — R is symmetric if 
f(vma): cae »Yn(k)) = f(v1,- hs VE) 
for all permutations z € S; and all vj € V. 


(a) Mimicking the definition of the exterior power ((V), define the symmetric power S*(V). 
(b) State and prove a universal mapping property for symmetric k-linear maps over R. 
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$20 Operations on Vector Bundles 


On a compact oriented Riemannian manifold M of dimension 2, let Q be the cur- 
vature matrix of the Riemannian connection relative to an orthonormal frame e, e2 
on an open subset of M. In the proof of the Gauss—Bonnet theorem, we saw that 
the 2-form K vol on M is locally the entry Q} of the curvature matrix. The integral 
Jy K vol turns out to be a topological invariant. There are many ways to general- 
ize the Gauss—Bonnet theorem; for example, we can (i) replace the 2-manifold M 
by a higher-dimensional manifold and replace K vol by some other differential form 
constructed from the curvature form of M, or (ii) replace the tangent bundle by an 
arbitrary vector bundle. To carry out these generalizations we will need some pre- 
liminaries on vector bundles. 

In this section, we define subbundles, quotient bundles, and pullback bundles. 
Starting from two C™ vector bundles FE and F over the same base manifold M, we 
construct their direct sum E © F over M; it is a vector bundle whose fiber at a point 
x € M is the direct sum of the fibers E;. and F,. This construction can be generalized 
to any smooth functor of vector spaces, so that one can obtain similarly C” vector 
bundles E ® F, Hom(E, F), EY, and ME over M (see Section 20.7). 


20.1 Vector Subbundles 


Definition 20.1. A C® subbundle of a C® vector bundle 7: E — M is aC™ vector 
bundle p: F — M such that 


(i) F is aregular submanifold of FE, and 
(ii) the inclusion map i: F — E is a bundle homomorphism. 


By composing a section of the subbundle F with the inclusion map i: F —> E, 
we may view a Section of F as a section of the ambient vector bundle E. 


Definition 20.2. A k-frame of a C™ vector bundle 7: E — M over an open set U 
in a manifold M is a collection of sections 51,...,s, of E over U such that at every 
point p in U, the vectors s|(p),...,5,(p) are linearly independent in the fiber Ey. In 
this terminology, a frame for a vector bundle of rank r is an r-frame. 


Lemma 20.3. Let 7: E — M be a C™ vector bundle of rank r and let p € M. Fix a 
positive integer k <r. AC™ k-frame s1,...,5, for E defined on a neighborhood U of 
p can be extended to a C™ r-frame for E on a possibly smaller neighborhood W of p. 


Proof. Replacing U by a smaller neighborhood of p if necessary, we may assume 
that there is a frame e),...,e, for E over U. On U each 5; is a linear combination 
$j = Laie with a, € C°(U). In matrix notation 
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The r x k matrix a= [a’] has rank k at every point in U. By renumbering e),...,e, 
we may assume that the top k x k block a’ of a is nonsingular at the point p. Since 
nonsingularity is an open condition there is a neighborhood W of p on which the 
k x k block a’ is nonsingular pointwise. 


/ 
F a : : 
We can extend the r x k matrix ad = | to a nonsingular r x r matrix b by 


adjoining an r x (r—k) matrix | ; 


/ 
a 0 
* I 
Then 
[S1 +++ Sk Ckga ++ er] = [er +++ er]d. 
Since b is an n Xn nonsingular matrix at every point of W, the sections s1,...,5,, 
€xk+1,---, er form a frame for E over W. 


20.2 Subbundle Criterion 


A vector bundle is a locally trivial family of vector spaces over a base space. The 
following theorem gives a sufficient condition for a family of vector subspaces over 
the same base space to be locally trivial and therefore to be a subbundle. 


Theorem 20.4. Let 7: E — M be aC vector bundle of rank r and F := || peu Fp a 
subset of E such that for every p in M, the set F, is a k-dimensional vector subspace 
of the fiber E,. If for every p in M, there exist a neighborhood U of p and m > k 
smooth sections S\,...,8m of E over U that span Fg at every point q € U, then F isa 
C™ subbundle of E. 


Proof. By making a neighborhood W of p in U sufficiently small, one may ensure 
the triviality of E over W, so that there exists a local frame e;,...,e, for E over W. 
On W, each section s; is a linear combination s; = ¥j_ ae; for an r x m matrix 


a= [a'' of C® functions. In matrix notation 


At every point of W the matrix a has rank k, because the columns of a are simply 
the column vectors of 51,...,5,, relative to the basis e;,...,¢@, and s1,...,5, span a 
k-dimensional vector space pointwise. In particular, at the point p the matrix a(p) = 
[a‘( p)| has rank k. It follows that a(p) has a nonsingular k x k submatrix a’(p). By 
renumbering s1,...,5 and e),...,e-, we may assume that a’ (p) is the left uppermost 
k x k submatrix of a(p). Since the nonsingularity of a matrix is an open condition, 
a’ is nonsingular on a neighborhood W’ of p in W. At every point of W’, since a’ has 
rank k, so does the matrix [1 --- s,]. This proves that s1,...,s,; form a frame for F 
over W’. 
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By Lemma 20.3, s1,...,5, can be extended to a C™ frame f),...,t;,...,¢, for E, 
with ¢; = s; for i= 1,...,k, over a possibly smaller neighborhood W” of p in W’. 
For any v € Elwy, let v = Y7_, c!(v)t;(x(v)). Then Fly is defined by c+! =0,..., 


c’ =00n Elwr. This proves that F is a regular submanifold of E, because the C* 
trivialization 


own : Ely s w" x R’, 


ves (a(v),c!(v),-..,€7(v)), 
induces a bijection Wyn: Fly + W" x R*. (The c’’s are C* by Problem 7.5.) As 
the restriction of aC” map, wr is C™. Its inverse, 
1 1 k A a 
Wyn: (p,c preeg€ ) nd cap) _ Y c'si(p), 
i=l i=1 

is C*. Hence, F is a C® vector bundle of rank k. The inclusion F <-> E, locally 
given by 


w"xRk5Ww’ xR’, 
(p,cl,...,c*) ++ (p,cl,...,c*,0,...,0) 


is clearly a bundle map. Thus, F is a smooth rank-k subbundle of E. 


20.3 Quotient Bundles 


Suppose F' is a C™ subbundle of a C™ vector bundle 7: E — M. At each point p in 
M, the fiber F,, is a vector subspace of E,, and so the quotient space Q, := Ep / F, is 


defined. Let 
Q:= L] Qp = | | G>/*). 


pEM pEM 


and give Q the quotient topology as a quotient space of E. Let p: E + Q be the 
quotient map. The projection 7: E — M then induces a map 71g: Q — M as in the 
commutative diagram 


Foc g's 9 


‘| Ao 


M. 


Since F is locally trivial, every point p in M has a coordinate neighborhood 
over which one can find a C™ frame s,...,5, for F. By Lemma 20.3, s1,...,5,% 
can be extended to a C™ frame s1,...,5,,5,41,---,5- for E over a possibly smaller 
neighborhood W of p. A point v of E|w is uniquely a linear combination 
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v= Yic'(v)s; (a(v)). 


The functions c! on Ely are coordinate functions and hence are C”. 

Let 5441,-..,5-: W — Q be the sections s;41,...,5, followed by the projection 
E > Q. Then (5,41,...,5,-) is a continuous frame for Q over W, and every element 
FE Olw:= To '(W) can be written uniquely as a linear combination 


p= Ya'(¥)5;(m9(9)). 
This gives rise to a bijection 


ow: Olw >WxR™, 


a (p.2°*1(0),...,0°(0)) | p=No(¥). 


The maps ¢ are continuous on Q|w because their lifts é’ o p =c' to E|w are contin- 
uous. Thus, @yw is a continuous map. Since 


by ee one = Yeap) 


is clearly continuous, dy is ahomeomorphism. Using the homeomorphisms @¢y, one 
can give Q a manifold structure as well as a C™ vector bundle structure over M. With 
this vector bundle structure, 79: Q — M is called the quotient bundle of E by F. 


20.4 The Pullback Bundle 


If 7: E — M isaC™ vector bundle over a manifold M and f: N — M is aC map, 
then there is aC™ vector bundle f*E over N, called the pullback of E by f, with the 
property that every bundle map covering f factors through the pullback bundle f*E 
(see Proposition 20.8). 

The total space of the pullback bundle of E by f is defined to be the set 


PE={(n,e)ENXE| f(n)=x(e)}, 
endowed with the subspace topology. The projections to the two factors, 


7: ffE ON, C: ffE SE, 
n(n,e) =n, C(n,e) =e, 


fit into a commutative diagram 


n| | (20.1) 


We will show that 7: f*E — N is a vector bundle. First, we show that the pullback 
of a product bundle is a product bundle. 
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Proposition 20.5. [f f: N — M is a C® map of manifolds and m: E=MxV—M 
is a product bundle, then the projection n: f*E — N is isomorphic to the product 
bundle Nx V + N. 


Proof. As a set, 
frE= {(n, (m, v)) ENx(MxV) | f(n) = x(m,v) =m} 
= {(n,(f(n),v)) EN x (MxV)}. 
The map 
o: ffESNXxYV, 


(n, (f(n),v)) - (n,y) 


with inverse (n,v) ++ (n,(f(n),v)) is a fiber-preserving homeomorphism. It gives 
1: f*E — N the structure of a C™ vector bundle over N. 


Theorem 20.6. Let 7: E — M be a C™ vector bundle with fiber V and f: N+ M 
a C® map. The projection n: f*E — N can be given the structure of a C® vector 
bundle with fiber V. 


For any map 7: E — M and open set U C M, recall that the restriction of E to 
U is denoted Ely := 27 !(U) 4 U. 


Proof. Since E is locally a product U x V — U, by Proposition 20.5, the pullback 
f*E is locally the product f~'!(U) x V > f7!(U). 


Lemma. Suppose ™: E — M is aC™ vector bundle with fiber V and f: N— M isa 
C™ map. Let U be an open subset of M. Then 


(PE) pay) =f (Elv)- 


Proof. By definition, 
(PE) py) ={(n,.e) ENXE|ne f'(U),f(n) =2(e)} 


{(n,e) € f-'(U) XE | f(n) = n(e)}. 
f (Elu) = {(ne) € f-'(U) x Elu | f(a) = a(e)}. 


Comparing the two, we see that they are equal. 


Proposition 20.7. Suppose 1: E — M is a C™ vector bundle with fiber V and trivi- 
alizing open cover {Ug} and f: N— M is aC® map. If gap: UnUg — GL(V) is 
the transition function for E over Uj Up, then f* gag is the transition function for 
f*E over f-'(UaN Up). 
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Proof. Suppose ¢q: E|y,, ~ Uo, x V is the trivialization for E over Ug, with 


bale) = (x(e),a(e)). 
Then 
f (Elva) = (FE) pauy 7 f (U) XV 
is given by 
(ne) (n, Oa(e)). 
So the transition function for f*E over f~! (Ug) f7! (Ug) is 


(Gao dg ')(F(n)) = (F*8ap)(n)- 


Proposition 20.8. Suppose 1p: F — N and mtg: E > M are vector bundles and 
@: F — Eis a bundle map that covers f: N — M, i.e., the diagram 


E 
|» (20.2) 


commutes. Then there is a unique bundle map @: F — f*E over N that makes the 
following diagram commute: 


(20.3) 


Proof. For all qg € F, the commutativity of the diagram (20.3) forces 6(q) = 
(ar(q),9(q)). This shows that @ is unique if it exists. Because — covers f, f(ar(q)) 
= 1 ((q)). Hence, (zr (q),9(q)) € f*E. So the map @ as defined above indeed 
exists. It is R-linear on each fiber because @ is. It is continuous because 7 and ~ 
are assumed continuous. 


This proposition shows that given f and E, the commutative diagram (20.1) of 
the pullback bundle f*E is a final object among all commutative diagrams of the 
form (20.2). 
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20.5 Examples of the Pullback Bundle 


The pullback construction for vector bundles is indispensable in differential geom- 
etry. In this section we give two examples. The first uses the pullback bundle to 
convert a bundle map over two different base manifolds to a bundle map over a sin- 
gle manifold. This is sometimes desirable in order to obtain an exact sequence of 
vector bundles over a manifold (see (27.5)). The second example uses the pullback 
bundle to clarify the notion of a vector field along a curve in a manifold, which we 
encountered in Sections 4.3 and 13.1. 


Example 20.9 (The differential of a map). If f: N — M is aC™ map of manifolds, its 
differentials fp: TpyN — Ty(p)M at all points p € N piece together to give a bundle 
map f.: TN + TM of tangent bundles. By Proposition 20.8, the bundle map f. 
induces a unique bundle map f.: TN — f*TM over N that makes the diagram 


commutative. The map ie : TN > f*TM is given by 
Xp © TyN +> (p, fe pXp € Ty(p)M). 


Conversely, f,. can be obtained from ‘2 as fx =O = In this way the bundle map /f,. 
over two base manifolds is converted to a bundle map /, over the single manifold N. 


Example 20.10 (Vector fields along a curve). If c: I—> M is a smooth map from an 
open interval J C R into a manifold M, then the pullback c*TM of the tangent bundle 
TM is a vector bundle over J. A section of the pullback bundle c*TM assigns to each 
t € J an element of the fiber (c*TM),; ~ T.)M, i.e., a tangent vector to M at c(t). In 
other words, a section of c*TM is precisely a vector field along the curve c(t) in M 
defined in Section 4.3. 


Exercise 20.11. Show that a vector field along a curve c in M is smooth if and only if the 
corresponding section of c* TM is smooth. 


Thus, we can identify the space of smooth vector fields along c(t) with the space 
of smooth sections of the pullback bundle c* TM: 


T(TM| eq) =F(c*TM). 
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20.6 The Direct Sum of Vector Bundles 


Suppose 7: E + M and zm’: E' > M are two smooth vector bundles of ranks r and 
r’, respectively, over a manifold M. For p € M, denote by E, the fiber x! (p) of E 
over p, and by Ey, the fiber (z’)~'(p) of E’ over p. Define the direct sum E @ E' as 
a Set to be the disjoint union 


E@E':= | | EE, := U {p} x (Ep@ Fh). (20.4) 
pEM pEM 


Let p: E®E’ — M be the projection p(p, (e,e’)) = p. We will now need to put a 
topology and a manifold structure on E $ E’ so that p: EG E’ > M becomes a C” 
vector bundle with fibers E, @ E = 


Lemma 20.12. If {Ua }aca and {Vg} pep are open covers of a topological space M, 
then {Ug Vp }(@,B)EAxB is again an open cover of M. 


Proof. Since {Vp} is an open cover of M, for each a € A, 


Ug =UaOM = Ua (Vp) =U(UanVp). 
B 


Thus 


M =(JUe = LU Ua Ve). 


a ap 


If {Ua}aea and {Vp} gcp are trivializing open covers for E and E’, respectively, 
then by Lemma 20.12 {Ug,M V5 }aea,pep is an open cover of M that simultaneously 
trivializes both E and E’. 

Choose a coordinate open cover UL = {Ug,}aea of M that simultaneously trivial- 
izes both E and E’, with trivializations 


Va: Ely, UR" and wy: E'\y, $UaxR". 


By adding all finite intersections Ug, M--- Ug, to the open cover L, we may assume 
that if Ua,Ug € Lt, then Ug Ug € L. For each p € Ug, there are linear isomorphisms 


Vo,p: Ep + {p} xR" and wo: E, > {p}x R”. 
By the functorial property of the direct sum, there is an induced linear isomorphism 


~N J 
ba,p = Yap P Wop: E,@E, + {p}x RT, 


and hence a bijection 


ga: | | 2p @E,=p7'Ua) $UaxR 
peUa 


that is a linear isomorphism on each fiber. 
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To simplify the notation we will sometimes write Ugg = Ug Ug. For a, BEA, 
the transition function 


ba ds: : Uap Xx Rt” _, p | (Uap) + Uap x Rt 
restricts on each fiber to 


bap ° 0), {p} xR" +E, OE, > {p} xR”, — pe Ugg. 


If we set go8(P) = Wa,p © Wg, and 84,5 (P) = Wap ° (We ,) |. then 


/ 


(G2° 5 ')(P.(¥.)) = (Ps (Warr © Wp)” (Warp © (¥p,p) YY’) 
= (p, (Sop (PY. Sup (r)’)) 
This shows that dq o op! is a continuous map, because ggg: Ugg — GL(r,R) and 
Six B° Uap GL/(r’,R) are both continuous. As its inverse 08 o Pq ' is also continu- 


ous, dg o %g lisa homeomorphism. 


Since Ug x R'*” ‘has a topology, we can use the trivialization dq to define a 
topology on p~!(Ug) so that dg becomes a homeomorphism. We now have a col- 
lection {p~!(Ua)}aca of subsets of E GE’ such that 


(i) E®E’ =Uaeap '(Ua): 
(ii) for any a, B € A, 


p~'(Ua)p~' (Ug) =p" (ap): 


which is again in the collection; 
(iii) for each a € A, there is a bijection 


ba: P~'(Us) + Ua xR; 
(iv) for each pair a, B € A, the map 
bao d_': op (P '(Uap)) > ba (p~' (Uap) 
is ahomeomorphism. 
We formalize this situation in a topological lemma. 


Lemma 20.13. Let C = {Sa}aea be a collection of subsets of a set S such that 


(i) their union is S; 
(ii) © is closed under finite intersections; 
(iii) for each a € A, there is a bijection 


ba: Sa — Ya, 


where Yq is a topological space; 
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(iv) for each pair «,B € A, ba(Sap) is open in Ye, where Syg = So, Sg, and the 
map 
Pa © op. : bp (Sap) => da (Sap) 
is a homeomorphism. 


Then there is a unique topology on S such that for each a € A, the subset Sq is open 
and the bijection og: Sq —> Yo, is a homeomorphism. 


The proof of this lemma is left as an exercise (Problem 20.1). Applying the 
lemma to the collection {p~!(Uqg)}aca of subsets of E 6 E’, we obtain a topology 
on E @ E’ in which each p~!(U,) is open and each 


ba: Pp '(Ua) + Ug xR” (20.5) 


is ahomeomorphism. Since the sets Ug x R’*” are homeomorphic to open subsets 
of R"+’*” and the transition functions @q op. are all C*, {(p~! (Ug), ba)} is aC” 
atlas on E @E’. Thus, E@E’ is aC” manifold. Moreover, the trivializations (20.5) 
show that p: E@ E’ > M isa C™ vector bundle of rank r+7’. 


20.7 Other Operations on Vector Bundles 


The construction of the preceding section can be applied to the disjoint union 
Lpem Ep @E, of the tensor product of fibers of E and E’. This produces a C” vector 
bundle E @ E’ of rank rr’ over M, whose fiber above p € M is the tensor product 
Ep @E,,. 

More generally, let V be the category whose objects are finite-dimensional real 
vector spaces and whose morphisms are isomorphisms, not merely linear maps, of 
vector spaces. In this category, if two vector spaces have different dimensions, 
then the set of morphisms between them is the empty set. Denote by V x V the 
category whose objects are pairs of finite-dimensional vector spaces and whose 
morphisms are pairs of isomorphisms of vector spaces. Let J: V x V > V be a co- 
variant functor that associates to a pair of finite-dimensional vector spaces (V,W) 
another finite-dimensional vector space J(V,W), and to a pair of isomorphisms 
(f: V > V',g: W > W’) an isomorphism T(f,g): T(V,W) > T(V’,W’). 

If V and V’ are finite-dimensional vector spaces of the same dimension n, let 
Iso(V,V’) be the set of all isomorphisms from V to V’. With respect to fixed bases 
for V and V’, elements of Iso(V,V’) are represented by nonsingular n x n matrices. 
Hence, Iso(V,V’) is bijective with GL(n,R), and therefore has the structure of a 
manifold as an open subset of R’*”. The functor J: V x V > V is said to be smooth 
or C® if for all finite-dimensional vector spaces V, V’, W, W’ with dimV = dimV’ 
and dimW = dimW’, the map 


J: Iso(V,V’) x Iso(W,W’) — Iso (T(V,W),T(V’,W')) , 
(f.8) > T(f,8), 


is smooth. 
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Example. Let T(V,W) = V @W. This is a smooth functor because if f: V > V’ and 
g: W — W’ are isomorphisms represented by matrices My and M, relative to some 
bases, then f @ g: V@W — V’ GW’ is represented by the matrix 


M; O 
O Mg, 
relative to the same bases. 
Mimicking the construction of the direct sum of two vector bundles, we obtain 
the following result. 


Proposition 20.14. [f J: Vx V — V is a C™ covariant functor, then for any two C~ 
vector bundles E and F over a manifold M, there is a C® vector bundle T(E, F) over 
M whose fiber at p € M is T(Ep, Fy). 


The same construction applies if J has any number of arguments. If a functor 
is contravariant, it must be turned into a covariant functor first to apply this con- 
struction. For example, consider the dual functor applied to a vector bundle E — M. 
Because of its contravariance, it associates to a trivialization 


Wp: Ep > {p} xR’ 


the map 


Wp: {Pp} x (R')" + E,, 


which has the wrong direction. We need to take the inverse of VW to get a trivializa- 
tion that goes in the right direction, 


(Wp) 1: Ey > {p} x (R')’. 


To construct the dual bundle, the functor J: V — V associates to every finite- 
dimensional vector space V its dual space V, and to every isomorphism f: V > W 
the isomorphism (f’)~!: VY > WY, 


In the category V the morphisms are all isomorphisms precisely so that one can 
reverse the direction of a map and make a contravariant functor covariant. In this 
way, Starting from C™ vector bundles F and F over M, one can construct C™ vector 
bundles E@F, E@F, \*‘ E, EY, and Hom(E,F) ~ EY ® F over M. 

In Section 10.4 we defined a Riemannian metric on a vector bundle E + M asa 
C~ assignment of an inner product (, )» on the fiber E, to each point p in M. Using 
the multilinear algebra and vector bundle theory from the preceding sections, we can 
be more precise about the kind of object a Riemannian metric is. 

An inner product on a real vector space V is first of all a bilinear map: Vx V > R. 
By the universal mapping property, it can be viewed as a linear map: V ® V —> R, or 
an element of Hom(V ®V,R). By Proposition 18.15, 


Hom(V @V,R) ~ (V@V)" ~VY @vVY. 
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Thus, an inner product on V is an element of VY @ VY which is positive-definite and 
symmetric as bilinear maps. 

A Riemannian metric on the vector bundle E — M associates to each point p € M 
an element of E ® E>; in other words, it is aC” section of EY @ EY satisfying the 
positivity and symmetry conditions. 


Problems 


20.1* Topology of a union 
Prove Lemma 20.13. 


20.2. Exterior power of a vector bundle 
For a smooth vector bundle E over a manifold M, give the details of the construction of Ne E 
that show it to be a smooth vector bundle over M whose fiber at p € M is A* (Ep). 


20.3. Tensor product of two vector bundles 
For two smooth vector bundles E and F over a manifold M, give the details of the construction 
of E ® F that show it to be a smooth vector bundle over M whose fiber at p € M is Ey ® Fy. 
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$21 Vector-Valued Forms 


A differential k-form @ on a manifold M assigns to each point p € M an alternating 
k-linear map 
Op: TpMx---xT,M—-R. 


If instead of R, the target space of the map @, is a vector space V, then @& is called a 
vector-valued form. Vector-valued forms arise naturally in differential geometry, for 
example, when one wants to define connections and curvature for a principal bundle 
as we do in Chapter 6. In preparation, we gather together in this section some basic 
constructions and properties of vector-valued forms. 

Even more generally, the target space of a, may be the fiber E, of a vector 
bundle E over M. In this case, a is a differential form with values in a vector bundle. 
We have already encountered such an object: from Section 10.3, the curvature of a 
connection on a vector bundle E — M can be seen as a 2-form with values in the 
endomorphism bundle End(£). 

Throughout this section, V, W, and Z denote finite-dimensional real vector 
spaces. 


21.1 Vector-Valued Forms as Sections of a Vector Bundle 


Let M be a manifold and p a point in M. By the universal mapping property for 
an exterior power, there is a one-to-one correspondence between alternating k-linear 
maps on the tangent space 7,,M and linear maps from rN TM to R. If Ay(Z,M) is 
the space of alternating k-linear functions on the tangent space 7,,M, then by (19.3) 
and Theorem 19.13, there are canonical isomorphisms 


Ac(TpM) = (/\' TM). ~ (Tem). 


Thus, aC” k-form on M may be viewed as a C™ section of the vector bundle \ T*M, 
and so the vector space of C™ k-forms on M is 


=1(('T'M). 


It is a simple matter to generalize the usual calculus of differential forms on a 
manifold to differential forms with values in a finite-dimensional vector space V, or 
V-valued forms for short. 

Let T* be the Cartesian product of k copies of a vector space T. A V-valued 
k-covector on T is an alternating k-linear function f: T* + V. By the universal 
mapping property of the exterior power A‘T (Theorem 19.6), to each V-valued 
k-covector f: T* —> V there corresponds a unique linear map f: A‘T > V such 
that the diagram 
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A‘T 


‘| XxX (21.1) 


T* —_+Y)V 


f 


commutes, and conversely. We denote the vector space of V-valued k-covectors on 
T by Ax(T,V). The one-to-one correspondence f +> f in (21.1) induces a linear 
isomorphism 


Nw k 
A;(T,V) % Home (A T,V) 


By two standard isomorphisms of multilinear algebra (Proposition 18.14 and Theo- 
rem 19.13), 


Home (A"7,v) ~(A‘r) ev ~(A‘T’) av. 


It is customary to write TY as T*. Thus, a V-valued k-covector on T is an element of 
the vector space (A T*) @V. 

A V-valued k-form on a manifold M is a function that assigns to each point 
p © M aV-valued k-covector on the tangent space 7,M; equivalently, it assigns to 
each p € M an element of (AK T;M) ®V. If E is a vector bundle over M and V is a 
vector space, the notation E ® V will mean the tensor product of the vector bundle 
E with the product bundle M x V — M. Then a V-valued k-form is a section of the 
vector bundle (AK T*M)@®V. We denote the space of smooth V-valued k-forms on 
M by 


k 
ok(M,V) =T ((A T'M) av) 
Let v1,...,v¥, be a basis for V, and @ a V-valued k-form on a manifold M. For any 
point p € M and tangent vectors u1,...,ug € TpM, since Op(u1,..., ug) is an element 


of V, it is uniquely a linear combination of v1,...,v, with real coefficients depending 
on p and wy,...,uz. We denote the coefficient of v; by o7,(u1,---, ux). Then 


Op (Uy ,-- +5 Mk) = YOU (uy, .. Ue) Vi. (21.2) 


Because @, is alternating and k-linear, so is ou, (ur, ...;Ux) for each i. Thus, at, is a 
k-covector with values in R. We can rewrite (21.2) as 


Cn (Ui ave) = > (CO, OV; Wy vsti) 


i 


(This notation is consistent with that of the tensor product of a k-linear function with 
0-linear function, i.e., with a constant.) As p varies in M, we see that every V-valued 
k-form a on M is a linear combination 


a= Ya’ avi, 
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where the a! are ordinary k-forms on M. We usually omit the tensor product sign and 
write more simply a@ = ¥.a'v;. A k-form is also called a form of degree k. A form 
is homogeneous if it is a sum of forms all of the same degree. The V-valued k-form 
a is said to be smooth if the coefficients a! are smooth for all i= 1,...,n. It is 
immediate that this definition of smoothness is independent of the choice of basis. 


21.2 Products of Vector- Valued Forms 


Let V, W, Z be finite-dimensional vector spaces and uw: V x W > Z a bilinear map. 
One can define a product of a V-valued covector and a W-valued covector on a vector 
space T, 


Au(T,V) x Ae(T,W) + Anye(T,2Z); 


by the same formula as the wedge product of two scalar forms, with the bilinear map 
L replacing the multiplication of real numbers: 


(a-B)(t,--- tee) 
1 
= a DY (sgno)H (Ete); --- sto)» Bltotettys--sto(eto)) (21.3) 


CESK +e 
for @ € Ay(T,V) and B € Ap(T,W). 


This formula generalizes the wedge product of scalar covectors. The same proofs 
as in the scalar case [21, Section 3.7, pp. 26—27] show that 


(i) a B is alternating and multilinear in its argument, and hence is a (k+ ¢)-covector 
on T; 

(ii) instead of summing over all permutations in S;4¢, one may sum over (k, ¢)- 
shuffles: 


(@- B)(t,...,tere) 


— x (sgno)u (Olfotitin Fath) Vsmayaxctatia)): (21.4) 
(k,€)-shuffles 
oO 


(By Definition 19.14, a (k, 2)-shuffle is a permutation o in S;1,¢ such that 
o(1) <---< o(k) and o(k+1) <--:< o(k+2).) 


This a -B is a product in the sense of being R-linear in @ and in f. It is neither 
graded-commutative nor associative, even when V = W = Z. 

When applied pointwise to a manifold, with T = 7,,M, the product (21.3) gives 
rise to a bilinear map of vector-valued forms 


Q*(M,V) x Q'(M,W) > Q***(M,Z). 
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Proposition 21.1. Let v; be vectors in a vector space V and let w; be vectors in 
a vector space W. Suppose U: V xX W > Z is a bilinear map, and a = Ya'v; € 
Q*(M,V) and B = > B/w; € Q'(M,W) are vector-valued forms, where ot! and B/ 
are R-valued forms on a manifold M. Then 


a-B= Xa! AB) (vi, w;) € OF (M,Z). 


Proof. Fix p € M and uy,...,Uz4¢ € TM. Then 


(@-B)p(u1,.--,Ue+e) 
1 


= aA Y (sano) et (Op (ue 1)s«-+s4o())s Bp(Uo(e+t) «++ 4o(k+0))) 
OES +E 
te 
ke! oes 
(sgno) oe (ue(1); Ug(k)) vis > Bp (ws(e41)>-++Mo(e+0) i) 
j 


YX (sgno)ats (ug (1),--- Uo) BE Uocert)s--- Ho(e+o) H(Vi, wy) 


CESK +E 


= dl AB!) p(ur yee Use) M(Vi, Wi): 


To show that a A B is smooth, we may take {v;} and {w;} to be bases for V and 
W, respectively. Then a! and B/ are smooth by definition. Let {zy} be a basis for Z 
and suppose 


M(viows) = Dete 


By the formula just proven, 


a-B =>¥ (Ye A Bi) ane 


k Ci 


which shows that a - B is smooth. 


Example 21.2. An m x n matrix-valued form is an element of Q*(M,R”*”). Let 
ej; © R”*? be the matrix with 1 in its (i, )-entry and 0 in all other entries, and let 
é;; € R?*” and é;; € R”*” be defined similarly. If 


LU: R@XP x REX” _y Rmxn 
is matrix multiplication, then 


H(eij,€xe) = eijexe = Sjr€ie. 
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For a@ = Laie; € O*(M,R”*?) and B = Y BM ee € Q*(M,R?*"), by Proposi- 
tion 21.1, 
a- B= al A BM u(e;;, ee) 
ay arn O jee 
=> (Dat rB") eu = as, 
il Sk 


the wedge product of matrices of forms defined in Section 11.1. 


21.3 Directional Derivative of a Vector- Valued Function 


If f: M — R is a smooth function on the manifold M and X, is a tangent vector to M 

at p, then we may interpret X, f as the directional derivative of f at p in the direction 

Xp; indeed, for any smooth curve c(t) in M with initial point c(0) = p and initial 

vector c'(0) = Xp, 

Xpf= < f(c(t)). (21.5) 
t=0 


To extend the definition of X,f to a smooth vector-valued function f: M— V, 
choose a basis vj,...,V, for V and write f = >’ f'v; for some smooth real-valued 
functions f’: M — R. Define 


Xpf = V(X vi. (21.6) 
It is a routine exercise to show that the definition 21.6 is independent of the choice of 
basis (Problem 21.2). It follows that for a smooth vector-valued function f: M— V 
and the same smooth curve c(t) in M as in (21.5), 


Xpf = Xf vi (by (21.6) 
-X(4 f(el))% (by 215) 


_d 
dt 


t=0 


YS fi(c(t))vi (by (21.6)) 


t=0 i 


d 

= c(t)). 

af) 

This shows that for a vector-valued function f, one can still interpret X,f as the 
directional derivative of f at p in the direction Xp. 


21.4 Exterior Derivative of a Vector-Valued Form 


As before, M is a manifold and V is a finite-dimensional vector space. To define the 
exterior derivative of a V-valued k-form a € Qk (M,V), choose a basis v1,...,V, for 
V, write & = > av; with a! € Q*(M), and define 


da = ¥\(da')y;. (21.7) 
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It is easy to check that so defined, the exterior derivative da is independent of the 
choice of basis (Problem 21.3). This definition is consistent with the definition of the 
exterior derivative of a matrix-valued form in Section 11.1. 


Proposition 21.3. The exterior derivative on vector-valued forms on a manifold M is 
an antiderivation: if « € Q'(M,V), B € Q'(M,W), and uw: V x W — Z is a bilinear 
map of vector spaces, then 


d(a-B) = (da)-B + (—1)*8%a-dp. 


Proof. The easiest way to prove this is to write @ and f in terms of bases for V and 
W, respectively, and to reduce the proposition to the antiderivation property of d on 
the wedge product of ordinary forms. 

To carry this out, let {vj} be a basis for V and {wj} a basis for W. Then @ = 
Ya'v; and B = > B/w; for ordinary forms a’, B/ on M. By Proposition 21.1, 


a-B = Zo! BAW). 
Then 
gp eC (by (21.7)) 
_¥ ((der) 1B) u(vi0n)4 (—1)*8°'S" (ae! A dB!) w(vi,W9) 


i,j ij 
(antiderivation property of d on scalar forms) 
= ¥ (da')vj- ¥ Bw; + (-1) 88" ¥ avi ¥ (dB! )w; 


(Proposition 21.1) 
= (da) -B + (-1)*8%a-dB. 


If @ and f are matrix-valued forms on a manifold, say of size m x p and p xn, 
respectively, and 
LU: R@XP yx RPX” _y Rmxn 


is matrix multiplication, then by Example 21.2, a-B = aA, and Proposition 21.3 
becomes the antiderivation formula for the matrix wedge product: 


d(a AB) = (da) AB +(-1)**%a AaB. (21.8) 


21.5 Differential Forms with Values in a Lie Algebra 


In the cases of special interest to us, V is either a Lie algebra g or more specifically 
the matrix algebra gl(n,R). If u: g x g > g is the Lie bracket, then we write [ar, B] 
instead of a - B and call it the Lie bracket of the two g-valued forms a and B. For 
a € OF (M,g), B € Q'(M,g), p EM and wy,...,ue40 € TM, 
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[@, Bl p(u1,---,Ue+e) 


= ». (sgn) [Op (Ug (1)1-+-Mo(k))> Bp Uo(k-+1)>+++>Mo(k+e))]- (21.9) 
(k,¢)-shuffles 
oO 


For example, if o and B are g-valued 1-forms on a manifold M, and X and Y are C® 
vector fields on M, then by (21.9), 


[a, B](X,Y) = [a(X),B(Y)|—[a(¥), B(X)]. (21.10) 


In practice it is usually easier to calculate [a, B] using a basis. For V = g, Proposi- 
tion 21.1 assumes the following form. 


Proposition 21.4. Suppose {Aj,...,An} is a set of vectors in a Lie algebra g and the 
forms o € Q*(M,g) and B € Q'(M,g) can be written as o& = ¥,a'A; and B => B/Aj. 
Then 

(ar, B] = D(a AB’) [Ai,Aj] € O*"(M, 9). (21.11) 


ij 


Note that in this proposition {A1,...,A,} is an arbitrary set of vectors in g; it need 
not be a basis. 


Proposition 21.5. [f a is a g-valued k-form and B a g-valued ¢-form on a manifold, 
then 


[o, B] = (—1)"*"[B, ay). 


Proof. Let Bi,...,Bn be a basis for g. Then @ = > a'B; and B = > B/B; for some 
R-valued forms o' and B/. Then 


[@, B] = D(a" A B’) [BiB] 
ij 
= (-1)81 > (8! Aa) B;,Bil 
ij 
(since A is graded commutative and [ , | is skew) 


= (-1)""[B, a). 


Therefore, if @ and B are g-valued 1-forms, then [o, 8] is symmetric in o and B 
and [o, 0] is not necessarily zero. For the Lie bracket of g-valued forms, Proposi- 
tion 21.3 can be restated as follows. 


Proposition 21.6. The exterior derivative d on Q*(M,g) is an antiderivation with 
respect to the Lie bracket: if & and B are homogeneous g-valued forms on a manifold 
M, then 


dlc, B] = [dar B] + (—1)"8*[a, dB). 
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Differential Forms with Values in gl(n,R) 


On the Lie algebra gl(n,R) of n x n real matrices, there are two natural bilinear 
products, matrix multiplication and the Lie bracket. They are related by 


[A,B] = AB-—BA for A,B € gl(n,R). 
Let e;; be the n by n matrix with a 1 in the (i, j) entry and 0 everywhere else. Then 
€1jeKe = Sjxeit, 


where 6 jz is the Kronecker delta. 

For two homogeneous gl(n,R)-valued forms @ and B on a manifold M, we de- 
note their products induced from matrix multiplication and from the Lie bracket by 
a B and [a, B], respectively. In terms of the basis {e;;} for gl(n,R), 


a=) ov ey, B => BM ex 


for some real-valued forms ’/ and B on M. By Proposition 21.1 the wedge product 
of @ and B is 


a\p= By al NB e:sexe = pS a a BE cig, 
Like ike 


while their Lie bracket is 
[a, B] = oO AB lei, exe]. 


Proposition 21.7. If « and B are homogeneous g\(n,R)-valued forms on a manifold 
M, then 


(a, B] = & AB — (—1) 2 OP) B A a, 
Proof. Problem 21.4. 


Hence, if a is a homogeneous gl(n,R)-valued form on a manifold, then 


2 if d is odd 
jnl={ aAa if dega is odd, (21.12) 


0 if deg & is even. 


21.6 Pullback of Vector-Valued Forms 


Like scalar-valued forms, vector-valued forms on a manifold can be pulled back by 
smooth maps. Suppose V is a vector space, a a C” V-valued k-form on a manifold 
M, and f: N—+MaC® map. The pullback f* a is the V-valued k-form on N defined 
as follows: for any p € N and uy,..., uz, € T)N, 


(f* OL) p(ui,.-- UK) = Obf(py (falls, --- fallk)- 


The pullback of vector-valued differential forms offers no surprises; it satisfies the 
same properties as the pullback of scalar-valued differential forms. 
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Proposition 21.8. Suppose u: V x W = Z is a bilinear map of finite-dimensional 
vector spaces, and f : N — M is a smooth map of manifolds. 


(i) If {v1,...,Vn} is a set in the vector space V and a = ¥,a'v; for a! € QF(M), 
then fra = >(f*a')v. 

(ii) The pullback f* of vector-valued forms commutes with the product: if a € 
Q*(M,V) and B € Q'(M,W), then 


f°(a- B) = (frat): (F°B). 


(iii) The pullback f* commutes with the exterior derivative: for a € Q*(M,V), 
fida=df*a. 


Proof. All three properties are straightforward to prove. We leave the proof as an 
exercise (Problem 21.6). 


In particular, if U: g x g — g is the Lie bracket of a Lie algebra g, then 


f*[a, B) = [fr o, °B) 


for any g-valued forms a, 8 on M. 


21.7 Forms with Values in a Vector Bundle 


For a vector space V, a V-valued k-form on a manifold M assigns to each point p € M 
an alternating k-linear map 


T,)M x ++» x T,M > V. 


More generally, we may allow the vector space V to vary from point to point. If E 
is a vector bundle over M, then an E-valued k-form assigns to each point p in M an 
alternating k-linear map 


TpM x---X TpM — Ep. 
Reasoning as above, we conclude that an E-valued k-form is a section of the vector 
bundle ( j* T*M) @E. The space of smooth E-valued k-forms on M is denoted by 


QK(M,E) := r((A° T'M) @E). 


Example 21.9 (The curvature of a connection). Let E — M be a vector bundle with 
a connection V. In Section 10.3 we saw that the curvature of the connection assigns 
to each point p € M an alternating bilinear map 


TM X TyM — End(Ep). 


Thus, the curvature is a section of the vector bundle (A? 7*M) @End(E). It is in fact 
a C™ section (Problem 21.7). As such, it is a smooth 2-form on M with values in the 
vector bundle End(£). 


21.8 Tensor Fields on a Manifold 195 


21.8 Tensor Fields on a Manifold 


Vector fields and differential forms are examples of tensor fields on a manifold. 


Definition 21.10. An (a,b)-tensor field on a manifold M is a section of the vector 
bundle 


(@'TM) @ (QTM) =TMa-.-@TM@T'Me--@T'M, 
SS OO 
a b 


where the tangent bundle TM occurs a times and the cotangent bundle T*M occurs 
b times. 


Example. A Riemannian metric on M is a section of T*M @ T*M, so it is a (0,2)- 
tensor field. 


Example. Asa section of the tangent bundle TM, a vector field on M is a (1,0)-tensor 
field. 


Example. A differential k-form on M associates to each point p € M an alternating 
k-linear map 


T)pM x---x T>M > R. 


By the universal mapping theorem, a k-linear map on 7,,M corresponds to a linear 
map 


TM ®---®T,M > R. 


Hence, a k-form is a section of @*T*M corresponding to an alternating map. In 
particular, it is a (0,k)-tensor field. (A k-form is also a section of A‘ T*M > M,) 


More generally, if E is a vector bundle over M, then an E-valued (a,b)-tensor 
field is a section of 
b 
(@'7M) @(@'T'm) @E. 


Example. By Example 21.9, the curvature of a connection on a vector bundle E — M 
is a section of the vector bundle 


Hom(TM ®TM,End(E)) ~ (TM@TM)Y @End(E) (by Prop. 18.14) 
~T*M®T*M®End(E) (by Prop. 18.15). 


As such, it is an End(£)-valued (0,2)-tensor field on the manifold M. 
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21.9 The Tensor Criterion 


A tensor field T € br (M )) on a manifold M defines an F-multilinear function 


Q!(M)* x ¥(M)? > F, 
(@1,.--,@a,M1,---,Yp) +> T(@,.--,@a,M,---,Yp). 


Conversely, the following proposition gives a condition for a multilinear function to 
be a tensor field. 


Proposition 21.11 (The tensor criterion). There is a one-to-one correspondence 
between 


{ ¥- multilinear functions T :Q'(M)4 x %(M)? + ¥} 


and 
{ tensor fields T € T(T*?(M))} 


such that 
PO; 35 Op, Vinee eB) = Dy Oi ser Daw Vipers ope 


Because of this proposition, we often identify an F-multilinear function T with 
the tensor field T that it corresponds to. 


Proof. We give the proof only for (a,b) = (2,1), since the general case is similar. 
Suppose 
T: Q'(M) x Q!(M) x X(M) > F 


is 3-linear over F. Since Q'(M) =T(T*M) and X(M) =T(TM), by Proposi- 
tion 7.28, for each p € M there is a unique R-multilinear map 


Ty: TSM x T3M x TpM > R 
such that for all (@,@2,Y) €T(7*M) x T(T*M) x T(TM), 
T)(@1,p,@2,p,Yp) a T (@1,@2,Y)(p). 


By the universal property of the tensor product, the multilinear map 7, corresponds 
to a unique linear map 7, : T;M®T,M ®T,M — R such that 


Tp (O1,p ® @2,p @Yp) = T(@1,@2,Y)(p). 
In other words, qT, is an element of 


Hom(T;M @T;M ®T,M,R) ~ (T;M®T;M@T,M)Y 
=1,M®T,M®T,M_ (by Prop. 18.15). 


Thus, 7 is a section of T7!(M) =TM@TM@T*M. 
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21.10 Remark on Signs Concerning Vector- Valued Forms 


Formulas involving vector-valued forms often have additional signs that their scalar- 
valued or constant analogues do not have. For example, the Lie bracket is anticom- 
mutative on elements of a Lie algebra g: 


[A,B = —|B,A], 
but when applied to g-valued forms, 
[or, B] = (—1) re 08?) +118, ar). (21.13) 


There is a general rule that appears to describe the additional sign or at least to 
serve as a useful mnemonic: whenever two graded objects x and y are interchanged, 
the additional sign is (—1)*°8*4°8”, This applies to example (21.13) above. It also 
applies to Proposition 21.7: For A,B € gl(n,R), 


[A,B] = AB— BA, 
but for gl(n,IR)-valued forms, 


(or, B] = aA B — (—1) #842 B Aa. 


If we assign a degree of 1 to the exterior derivative d, then the antiderivation 
property in Proposition 21.3 also fits this pattern. 


Problems 


21.1. Associativity of the product of vector-valued forms 
Suppose 1: V x W — Z is a bilinear map of vector spaces, A € Q'(M), a € Q4(M,V), and 
B €.Q°(M,W). Prove that 


(Act) -B =A-(a-B) = (-1)"a- (A-B). 


(Hint: Write @ and B in terms of a basis for V and a basis for W, respectively, and apply 
Proposition 21.1.) 


21.2. Directional derivative of a vector-valued function 
Show that the definition (21.6) of the directional derivative is independent of the choice of 
basis. 


21.3. Exterior derivative of a vector-valued function 
Prove that the definition (21.7) of the exterior derivative is independent of the choice of basis. 


21.4. Lie bracket in terms of the wedge product 
Prove Proposition 21.7. 


21.5. Triple product of 1-forms 
Show that if @ is any g-valued 1-form on a manifold, then 


[[o, &], «] = 0. 
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21.6. Pullback of vector-valued forms 
Prove Proposition 21.8. 


21.7. Curvature as a section 
Let V be a connection on a smooth vector bundle 7: E — M. Show that the curvature of V is 
a smooth section of the vector bundle ( he T*M) @End(E). 


21.8. Exterior derivative of a vector-valued 1-form 
Let V be a vector space. Prove that for a C® V-valued 1-form @ and C™ vector fields X,Y 
on M, 

(d@)(X,Y) =Xa@(Y) —Yo(X) — @([X,Y]). 


(Hint: See [21, Th. 20.14, p. 233].) 


21.9.* Maurer-Cartan equation 
The Maurer—Cartan form on a Lie group G with Lie algebra g is the unique g-valued 1-form 
6 on G that is the identity map at the identity element e ¢ G. Thus, for X, € 7,G and Xz, € T,G, 


0(Xe) =Xe Eg 
and 
Oo (Xe) = (f-1 Oc) (Xg) = Oe(ly-1.Xg) = fy-14Xg. 
Prove that the Maurer—Cartan form satisfies the Maurer—Cartan equation 
1 
d0+ 518.4] =0. 
(Hint: By F-linearity, it suffices to check this equation on left-invariant vector fields.) 


21.10. Right translation of the Maurer—Cartan form 
Prove that under right translation the Maurer—Cartan form @ on a Lie group G satisfies 


r,0=(Adg')@ forgeG. 


Chapter 5 


Vector Bundles and Characteristic Classes 


The Gauss—Bonnet theorem for a compact oriented Riemannian 2-manifold M may 
be stated in the following way: 


1 
ia 55K vol =7(M), 


where K is the Gaussian curvature. What is especially significant about this theorem 
is that on the left-hand side the 2-form K vol is locally the curvature form Q} rela- 
tive to an orthonormal frame of the Riemannian metric, but on the right-hand side the 
Euler characteristic y(M) is a diffeomorphism invariant, independent of the Rieman- 
nian structure. Thus, the Gauss—Bonnet theorem for surfaces raises two interesting 
questions about higher-dimensional compact oriented manifolds: 


(i) Is there a differential form whose integral over M gives the Euler characteristic? 
(ii) Is it possible to construct from the curvature tensor diffeomorphism invariants 
other than the Euler characteristic? 


Let H'(M) be the de Rham cohomology vector space of M in degree i. The ith 
Betti number b; of M is the dimension of H'(M). In algebraic topology one learns 
that the Euler characteristic of a compact manifold may be computed as the alternat- 
ing sum of the Betti numbers [18, Th. 22.2, p. 124]: 


dimM : 
x(M) =D) (-1)'Bi. Cy 
i=0 
If the manifold is not compact, this may be an infinite sum and the Euler character- 
istic need not be defined. 
Furthermore, the Poincaré duality theorem for a compact orientable manifold 
asserts that the integral of the wedge product of forms of complementary dimensions 


H'(M) x H"-'(M) S H"(M) 4R, 


((ol, fe) f one, 
M 
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is a nondegenerate pairing [3, p. 44]. Hence, by Lemma 19.11, there is a linear 
isomorphism of H'(M) with the dual of H"~'(M): 
H'(M) ~ H""(M)Y 
and 


b! = pri, 


Example. If M is a compact orientable manifold of dimension 5, then its Betti num- 


bers pair up: 
bo=bs, bh=ba, bo =b3 


and so 


4(M) = bo — by + bo — b3 + bg — bs 
= (bo — bs) + (bz — b3) + (ba — 1) 
=0. 


This example generalizes to any odd-dimensional compact orientable manifold. 


Proposition. The Euler characteristic of a compact orientable odd-dimensional 
manifold is 0. 


Proof. If n= dimM 1s odd, then the Betti numbers b; and b,_; are equal by Poincaré 
duality. Moreover, they occur with opposite signs in the alternating sum for the Euler 
characteristic. Hence, all the terms in (*) cancel out and y(M) = 0. 


Thus, from the point of view of generalizing the Gauss—Bonnet theorem by an- 
swering question (i), the only manifolds of interest are compact oriented Riemannian 
manifolds of even dimension. 

Following Chern and Weil, we will associate to a vector bundle EF over M global 
differential forms on M constructed from the curvature matrix of a connection EF. 
These forms turn out to be closed; moreover, their cohomology classes are indepen- 
dent of the connection and so are diffeomorphism invariants of the vector bundle E. 
They are called characteristic classes of E. Specializing to the tangent bundle TM, 
we obtain in this way new diffeomorphism invariants of the manifold M@, among 
which are the Pontrjagin classes and Pontrjagin numbers of M. 
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This section collects together some facts about connections and curvature on a vector 
bundle that will be needed in the construction of characteristic classes. We first 
study how connection and curvature matrices transform under a change of frame. 
Then we derive the Bianchi identities, which are formulas for the exterior derivative 
of the connection and curvature matrices. Just as there are two structural equations, 
corresponding to the torsion vector and the curvature matrix, so there are two Bianchi 
identities, corresponding to the exterior derivatives of the two structural equations. 
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22.1 Connection and Curvature Matrices Under 
a Change of Frame 


In Section 10.6 we showed that a connection on a vector bundle E —> M may be 
restricted to a connection over any open subset U of M: 


Vv": £(0) xT(U,£) 31 (U,£). 


We will usually omit the superscript U and write V for V!. 
Suppose there is a frame e),...,e, for E over U. Then the connection matrix 
of V relative to the frame e),...,e, over U is the matrix [ai] of 1-forms defined by 


Vxej = ¥ 0} (X)e;, 


for X € X(U). If we write the frame e),...,¢e, as a row vector e = |e, --- e;], then in 
matrix notation 


Vxe=eo(X). 


As a function of X, 
Ve=eo. 


Similarly, for C® vector fields X,Y on U, the curvature matrix © of the connec- 
tion V is the matrix [Q'] of 2-forms defined by 


R(X, Y)ej = YO'(X,V)e;. 


We next study how the connection and curvature matrices @ and Q transform 
under a change of frame. Suppose @),...,é@, is another frame for E over U. Let 
oO= [a5] and Q = [Q'] be the connection and curvature matrices of the connection 
V relative to this new frame. At each point p, the basis vector é(p) is a linear 
combination of e;(p),...,e-(p): 


2(p) = Yai (p)ex(p). 


As sections on U, 
7 k 
ee = y Apex. 


So we get a matrix of functions a = [ak]. As the coefficients of a smooth section with 
respect to a smooth frame, the a are smooth functions on U. At each point p, the 
matrix a(p) = [ak (p)] is invertible because it is a change of basis matrix. Thus we 
can think of a as a C® function U — GL(7,R). As row vectors, 
k 

|= [er -- erllag), 
or in matrix notation, 

é=ea. (22.1) 


Since in matrix notation (22.1), the matrix of functions a, must be written on the 
right of the row vector of sections e;, we will recast the Leibniz rule in this form. 
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Suppose s is a C™ section of the vector bundle EF and f is a C™ function on M. By 
the Leibniz rule, for any C® vector field X € X(M), 


= (Vxs) f +sdf(X). (22.2) 


We may view Vs as a function X(M) — T(E) with (Vs)(X) = Vxs. Then we may 
suppress the argument X from (22.2) and rewrite the Leibniz rule as 


V(sf) =(Vs)f+sdf forsET(E), feC°(M). (22.3) 


Theorem 22.1. Suppose e and é are two frames for the vector bundle E over U such 
that é = ea for some a: U — GL(r,R). If @ and @ are the connection matrices and 
Q. and Q are the curvature matrices of a connection V relative to the two frames, 
then 


(i) @ =a !wa+a-'da, 
(ii) Q =a! Qa. 


Proof. (i) We use the Leibniz rule (22.3) to derive the transformation rule for the 
connection matrix under a change of frame. Since é = ea, we have e = éa—!. Recall 
that if a = [a’ is a matrix of C~ functions, then da is the matrix of C™ 1-forms [da’]. 
Thus, 


Vé = V(ea) 
= (Ve)a+eda (Leibniz rule) 
= (e@)a+eda (Ve=eo) 


=éa'watéa'da (e=éa') 


=é(a'@a+a'da). 


Therefore, 
@=a 'wat+a ‘da. 


(ii) Since the structural equation (Theorem 11.1) 
Q=do+@A@ (22.4) 


expresses the curvature Q in terms of the connection matrix @, the brute-force way 
to obtain the transformation rule for Q is to plug the formula in part (i) into (22.4), 
expand the terms, and try to write the answer in terms of Q. 

Another way is to note that since for any X,,¥, € T,M, the curvature R(X,,Y,) 
is the linear transformation of E, to E, with matrix [Q/(X),Y))] relative to the basis 
€1,...,e, at p, a change of basis should lead to a conjugate matrix. Indeed, for 
X,Y E€X(U), 


R(X, Y)ej = YQi(X,Y ei, 
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or in matrix notation, 
R(X,Y)e=R(X,Y)[e1,..-,e] = [e1,.-- er] [Q5(X,¥)] = eQ(X,Y). 


Suppressing X,Y, we get 


R(e) = eQ. 
Hence, 
R(é) = R(ea) 
=R(e)a (R is F-linear in the argument e) 
= eQa 
=éa 'Qa. 


This proves that Q = a~!Qa. 


22.2 Bianchi Identities 


When E — M is the tangent bundle, a connection on 
E is an affine connection as defined in Section 6.1. 
For an affine connection V, the torsion 


T (X,Y) =Vx¥ —VyX — [X,Y] 


is defined and is a tensor. If e),...,é, is a frame for 
the tangent bundle TM over U, then the torsion forms 
T' are given by 


TRY) => FY ex 


We collect t!,...,7” into a column vector t = [t'] 
of 2-forms. Similarly, if @!,...,@” are the 1-forms 
dual to the basis e1,...,é,, then @ is the column vec- 
tor [6']. Recall the two structural equations (Theo- Luigi Bianchi 


rem 11.7): (1856-1928) 


(1) t=d0+Q@A0, 

(2) Q=da+aNa. 
Proposition 22.2 (First Bianchi identity). Let V be a connection on the tangent 
bundle TM of a manifold M. Suppose 9 and Tt are the column vectors of dual 


1-forms and torsion forms, respectively, on a framed open set, and @ and Q are 
the connection and curvature matrices respectively. Then 


dtT=QAO—-OAT. 
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Proof. Differentiating the first structural equation gives 


dt =d(d0)+daA\9—@Adé@ 
=04+(Q-@A@)AO—@A(tT-—@/NB8) 
(rewriting d@ and d@ using the two structural equations) 
=QAA—@AT (since @\ @A/ 8 cancels out). 


Like the second structural equation, the next two identities apply more generally 
to a connection V on any smooth vector bundle, not just the tangent bundle. 


Proposition 22.3 (Second Bianchi identity). Let V be a connection on a smooth 
vector bundle E. Suppose @ and Q are the connection and curvature matrices of V 
relative to a frame for E over an open set. Then 


dQ =QA0-@AQ. 
Proof. Differentiating the second structural equation gives 


dQ = d(d@)+ (dw) \a—-aAda 
=04+(Q-@A0)A@—-—@A(Q-@A0) 
(rewriting d@ using the second structural equation) 
=QA0-@AQ. 


We will use the notation Q* to mean the wedge product Q/ --- A Q of the curva- 
ture matrix k times. 


Proposition 22.4 (Generalized second Bianchi identity). Under the same hypot- 
heses as in Proposition 22.3, for any integer k > 1, 


d(Q*) =2% Na-—wAQ. 


Proof. Problem 22.3. 


22.3 The First Bianchi Identity in Vector Form 


For a Riemannian connection, the torsion form T is zero and the first Bianchi identity 
(Proposition 22.2) simplifies to 
QA =0. 


This identity translates into a symmetry property of the curvature tensor. 


Theorem 22.5. [f X,Y,Z are C™ vector fields on a Riemannian manifold M, then the 
curvature tensor of the Riemannian connection satisfies 


R(X,Y)Z+R(Y,Z)X +R(Z,X)¥ =0. 
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Proof, Let e1,...,€, be a frame over an open set U in M, and @!,...,0” the dual 
1-forms. On U, Z=);; 6/(Z)e;. In the notation of the preceding section, 


R(X,Y)Z = ¥ R(X, Y)0!(Z)e; 
j 


= ¥0'(X,Y)0!(Z)e;. 
ij 
The ith component of QA 0 is > f Qi A 6/. Thus, for each i, the first Bianchi 
identity gives 


0= UQiA 0/)(X,Y,Z) 


= D5 (X,Y) 0! (Z) — Q5(X,Z)0/(Y) + Qi(Y,Z)6/(X). 


Multiplying by e; and summing over i, we obtain 


0 =R(X,Y)Z+R(Z,X)¥ + R(Y,Z)X. 


22.4 Symmetry Properties of the Curvature Tensor 


With the first Bianchi identity, we have now proven three symmetry properties of the 
curvature tensor: Let X,Y,Z,W be C™ vector fields on a Riemannian manifold. Then 
the curvature tensor of the Riemannian connection satisfies 


(i) (skew-symmetry in X and Y) R(X,Y) =—R(Y,X). 
(ii) (skew-symmetry in Z and W, Proposition 12.5) 


(iii) (first Bianchi identity) 
R(X, Y)Z+R(Y,Z)X + R(Z,X)Y =0. 


As an algebraic consequence of these three properties, a fourth symmetry prop- 
erty follows. 


Theorem 22.6. IfX,Y,Z,W are C™ vector fields on a Riemannian manifold, then the 
curvature tensor of the Riemannian connection satisfies 


(R(X,Y)Z,W) = (R(Z,W)X,Y). 
Proof. By the first Bianchi identity, 
(R(X, Y)Z,W) + (R(Y,Z)X,W) + (R(Z,X)Y,W) =0. (22.5) 


To cancel out the term (R(Y,Z)X,W) in (22.5), we add to (22.5) the first Bianchi 
identity starting with the term (R(Y,Z)W,X): 


(R(Y,Z)W,X) + (R(X,W)Y,X) + (R(W,Y)Z,X) =0. (22.6) 
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Similarly, to cancel out the term (R(Z,X)Y,W) in (22.5), we add 

(R(Z,X)W,Y) + (R(X,W)Z,Y) + (R(W,Z)X,Y) =0. (22.7) 
Finally, to cancel out (R(W,Y)Z,X) in (22.6), we add 

(R(W,Y)X,Z) + (R(Y,X)W,Z) + (R(X,W)Y,Z) =0. (22.8) 


Adding up the four equations (22.5), (22.6), (22.7), (22.8) and making use of the 
skew-symmetry properties, we get 


2(R(X,Y)Z,W) —2(R(Z,W)X,Y) =0, 


which proves the theorem. 


22.5 Covariant Derivative of Tensor Fields 


A connection V on the tangent bundle TM of a manifold M induces a covariant 
derivative on all tensor fields. This is the content of the following proposition. 


Proposition 22.7. Let V be a connection and X,Y C® vector fields on the mani- 
fold M. 


(i) If @ is aC” 1-form on M, then Vx @ defined by 
(Vx@)(Y) :=X(@(Y)) — @(VxY) 


isa C®™ I-form on M. 
(ii) If T is a C® (a,b)-tensor field on M, then VxT defined by 


(VT )(@1s <5 Oa, Figs+,¥o) SE (TF (@1y.+45.0a,Viy0s05¥5)) 


a 

= ST Oise V x Oicccle Vises Yh) 
b 

= y T(Q1,..-,@a,V1,...,VxYj,---Yo), 


for @; € Q'(M), Y; € X(M), is a C® (a,b)-tensor field on M. 


Proof. By the tensor criterion (Proposition 21.11), it suffices to check that Vx @ and 
VxT are F-linear in its arguments, where F := C”(M) is the ring of C® functions 
on M. Let f EF. 


(i) By definition and the Leibniz rule in Y of a connection, 
(Vx@)(fY) =X(fo(Y)) — (Vx fY) 


= (Xf)o(Y) + fX(@(Y)) —@((Xf)Y) — fo(Vx¥) 
= f(Vx@)(Y). 
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(ii) Replace @; by f@; in the definition of Vxy7. On the right-hand side, we have 


SOT Gig POR e.g Vine oig Yn) 
= (Xf)T (1,...,@a,V1,..-,¥p) + fX(T(@1,.--;@a,¥1,---,¥5)) (22.9) 


and 


T(@1,...,Vxf@;,.--,@a,M1,---,Yb) 


=T(@,...,(Xf)Q@;,---,@a,%1,---, Yo) + fT (@1,...,Vx@;,---@a,%,---, Yb). 
(22.10) 


The first terms of these two expressions cancel out. All the other terms on the 
right of (22.9) and (22.10) are F-linear in the ith argument. 

Similarly, the right-hand side of the definition of Vy7 is F-linear in the argu- 
ment Yj. 


On C™ functions f, we define Vx f = Xf. 
Let T“9(M) = (@" TM) ® (@° T*M) be the bundle of (a,b)-tensors on M. 


Theorem 22.8. Let V be a connection on a manifold and X € X(M). The covariant 
derivative Vx : T( Depo T*>(M)) +1 (@2,-9 T%?(M)) satisfies the product rule 
for any two tensor fields T, , Th: 


Vx(T ® Py) = (VxTi) ®@h+T, @Vxh. 


Proof. Problem 22.4. 


22.6 The Second Bianchi Identity in Vector Form 


If V is a connection and R(X,Y) is its curvature endomorphism on a manifold, then 
we define the Riemann curvature tensor or simply the curvature tensor to be 


Rm(X,Y,Z,W) = (R(X,Y)Z,W), X,¥,Z,W © X(M). 


The second Bianchi identity is obtained by differentiating the curvature. In vector 
form, this means taking the covariant derivative of the curvature tensor Rm. 


Theorem 22.9 (The second Bianchi identity in vector form). On a Riemannian 
manifold, if a connection V is compatible with the metric, then for all X,Y,Z,V, 
WeEX(M), 
(Vx Rm)(Y,Z,V,W) =0. 
cyclic in X,Y,Z 


(This notation means we cyclically rotate X, Y,Z to obtain two other terms.) 
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Proof. Since (Vx Rm)(Y,Z,V,W) is F-linear in all of its argument, it suffices to ver- 
ify the identity at a point p € M and for X,Y,Z,V,W a frame of C™ vector fields near 
p. Such a frame is the coordinate frame 01,...,0, of a normal coordinate neighbor- 
hood U of p. The advantage of using a normal coordinate system is that VyY = 0 
at p and [X,Y] =0 on U for any coordinate vector fields X,Y, ie. X = 0;, Y =0j 
for some i, j (Theorem 15.4). This greatly simplifies the expression for the covariant 
derivative of a tensor field. 

Thus, if X,Y,Z,V,W are coordinate vector fields on a normal neighborhood of p, 
then at the point p 


m(Y,Z,V,W)) —Rm(VxY,Z,V,W) —-:- 
m(Y,Z,V,W)) since VxY(p) =0 
(R(Y,Z)V,W) 
(VxR(Y,Z)V,W) + (R(Y,Z)V,VxW) 

by compatibility of V with the metric 
= (VxVyVzV,W) —(VxVzVyV,W). 


Cyclically permuting X,Y,Z, we get 
(Vy Rm) (Z,X, V, Ww) = (Vy VzVxV, Ww) — (Vy VxVzV, Ww) 


and 
(VzRm)(X,Y, V,W) = (VzVxVyV,W) = (VzVyVxV,W). 


Summing the three equations gives 
(Vx Rm)(Y,Z,V,W + (Vy Rm)(Z,X,V,W) + (VzRm)(X,Y,V,W 
= (R(X,Y)VzV,W) + (R(Y,Z)VxV,W) + (R(Z,X)VyV,W). 
On the right-hand side, because R(—,—)-— is a tensor, we can evaluate the arguments 


at a single point p. Since VxV, VyV, VzV all vanish at p, all three terms are zero. 
This establishes the second Bianchi identity in vector form. 


For the equivalence of the second Bianchi identity (Proposition 22.3) and the 
second Bianchi identity in vector form (Theorem 22.9), see [12, Vol. 1, Theorem 
5.3, p. 155]. 


22.7 Ricci Curvature 


The curvature tensor (R(X,Y)Z,W) is a complicated object, but from it one can 
construct other invariants of a Riemannian manifold M. One such is the sectional 
curvature introduced in Section 12.4. 

Another is the Ricci curvature, which associates to two tangent vectors u,v € 
T,M at p the trace of the linear endomorphism 


wh» R(w,u)v. 


22.9 Defining a Connection Using Connection Matrices 209 


Thus, if X,Y are C~ vector fields on an open set U in M and £},...,E, is an or- 
thonormal frame on U, then 


Ric(X,Y) = tr (R(—,X)Y) 


= S(R(E;,X)Y,E;). 


By the symmetry properties of R, for the Riemannian connection 
Ric(X,Y) = }\(R(E;,X)Y,E;) (by definition) 
= )(R(Y,E))E;,X) (by Theorem 22.6) 
= ¥ (RE, Y)X,E) 
= Ric(Y,X). 


Thus, the Ricci curvature Ric(X, Y) is a symmetric tensor of type (0,2). 


22.8 Scalar Curvature 


At each point p of a Riemannian manifold M, the Ricci curvature is a symmetric 
bilinear form 


Ric : TM x TM — TM. 


The scalar curvature at p is the trace of the Ricci curvature at p, defined as follows. 
By the nondegeneracy of the Riemannian metric ( , ), there is a unique linear 
map p: T,M — T,M such that 


Ric(u,v) = (p(u),v) for allu,v € T,M. 


The scalar curvature S(p) at the point p is defined to be the trace of p. 
If e,,...,@, is an orthonormal basis for 7,,M, then 


p) =trp = ¥\(p(e;),e;) 


22.9 Defining a Connection Using Connection Matrices 


In the next section we will describe how a connection on a vector bundle induces a 
connection on a pullback bundle. The easiest way to do this is to define a connection 
using connection matrices. 

If V is a connection on a vector bundle E — M, then relative to each framed 
open set (U,e1,...,¢,) for E, there is a connection matrix @.. By Theorem 22.1, the 
connection matrix relative to another frame é = ea is related to @, by the formula 
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@: =a '@a+ta ‘da. (22.11) 


For s € T(E) and X € X(M), to define Vys € T(E), it suffices to define (Vxs)|y 
for the open sets U of an open cover of M. By the definition of the restriction of a 
connection, 

(Vxs)|u = Vx (slu)- 
Thus, a connection V on E is completely determined by its restrictions VY to the 
open sets of an open cover for M. Conversely, a collection of connections V” on 
the open sets U of a trivializing open cover of M that agree on pairwise intersections 
UU’ defines a connection on E. 

Equivalently, since a connection on a trivial bundle is completely specified by its 
connection matrix relative to a frame (see Section 11.2), a connection on a vector 
bundle can also be given by a collection of connection forms {@,,(U,e)} satisfying 
the compatibility conditions (22.11). 


22.10 Induced Connection on a Pullback Bundle 


In this section we show that under a C~ map f: N — M, a connection on a vector 
bundle E — M pulls back to a unique connection on f*E — N. 


Theorem 22.10. Let V be a connection on a vector bundle E — M and f: N + M 
a C® map. Suppose M is covered by framed open sets (U,e1,...,e-) for E and the 
connection matrix relative to the frame e is Me. Then there is a unique connection 
on f*E whose connection matrix relative to the frame f*e,,...,f*e, on f—'(U) 
is f* (Qe). 

Proof. The matrix f*(@,-) of 1-forms defines a connection on the trivial bundle 
(f*E)| ¢-1(y) Telative to the frame f*e (Section 11.2). If é = ea is another frame 
on U, then by Theorem 22.1 


Oz = a '@a+a ‘da on U. 
Taking the pullback under f* gives 
f° (we) = (f'a)' f° (@e) fiat (fra) 'df*a on f'(U). 


Since f*é = (f*e)(f*a), the equation above shows that { f*(@,)} satisfies the com- 
patibility condition (22.11) and defines a unique connection on f*E. 


Problems 


22.1. Differential of the inverse of a matrix of functions 
Ifo= [o’] is a matrix of differential forms on a manifold, we define d@ to be the matrix 


whose (i, j)-entry is d (oi). Leta= [a‘] be a matrix of functions on a manifold. Prove that 
d(a—') =—a~!(da)a™!. 


(Note: Because a and da are matrices, we cannot combine the right-hand side into —(da)a~? 
as in calculus.) 
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22.2. Matrix of a linear transformation 

Let L: V + V be a linear transformation of a finite dimensional vector space V. Suppose 
e|,.--,er and @,...,@ are two bases of V such that @; = Leia. If L(e;) = Lier} and 
L(ée) = Dx Ak, prove that A = a~!Aa, where A = [AK and A = [Ai] are the matrices of L 
with respect to the two bases @ and e. 


22.3. Generalized second Bianchi identity 
Prove Proposition 22.4. 


22.4. Product rule for the covariant derivative 
Prove Theorem 22.8. 
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$23 Characteristic Classes 


A connection on a vector bundle E of rank r over a manifold M can be represented lo- 
cally by a matrix @ of 1-forms relative to a frame for E over an open set U. Similarly, 
the curvature of the connection can be represented by a matrix Q of 2-forms over U. 
Under a change of frame @ = ea, the ceva matrix transforms by conjugation 
Q = a~!Qa. Thus, if P(X) is a polynomial in r? variables invariant under conjuga- 
tion by elements of GL(r,R), then the differential form P(Q) will be independent 
of the frame and will define a global form on M. It turns out that this global form 
P(Q) is closed and is independent of the connection. For a fixed vector bundle E, the 
cohomology class [P(Q)] is therefore a well-defined element of H*(M) depending 
only on the invariant polynomial P(X). This gives rise to an algebra homomorphism 


ce: Inv(gl(r,R)) > H*(M), 


called the Chern—Weil homomorphism from the algebra Inv (gl(7,R)) of invariant 
polynomials on gl(r,R) to the de Rham cohomology algebra of M. 

For each homogeneous invariant polynomial P(X) of degree k, the cohomology 
class [P(Q)] € H?*(M) is an isomorphism invariant of the vector bundle: if two 
vector bundles of rank r over M are isomorphic, then their cohomology classes ass- 
ociated to P(X) are equal. In this sense, the class [P(Q)] is characteristic of the 
vector bundle E and it is called a characteristic class of E. A global form P(Q) 
representing a characteristic class is called a characteristic form. 


23.1 Invariant Polynomials on gl(r,R) 


Let X = |x x] be an r x r matrix with indeterminate entries xi . A polynomial P(X) on 
gl(r,R) = R™* is a polynomial in the entries of X. A polvaoinial P(X) on gl(7,R) is 
said to be Ad GL(r, R)-invariant or simply invariant if 


P(A7!XA) = P(X) forall A € GL(r,R). (23.1) 


By Proposition B.1 in Appendix B, if (23.1) holds for all r x r matrices X of 
real numbers, then it holds when X is a matrix of indeterminates, i.e., P(X) is an 
invariant polynomial on gl(r,R). Thus, tr(X) and det(X) are examples of invariant 
polynomials on gl(7,R). 


Example 23.1 (Coefficients of the characteristic polynomial). Let X = [x] be an 
r X r matrix of indeterminates and let A be another indeterminate. The coefficients 


f(X) of A’* in 
det(AI+X)=A'+fi(X)A™ | +5 fa (X)A + f-(X) 


are polynomials on gl(r,IR). For any A € GL(r,R) and any r x r matrix X of real 
numbers, 
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det(AI+A~!XA) = det(AI+X). 


Comparing coefficients of A’~*, we get 
f(A _'XA) = fe(X) for all A € GL(r,R),X € gl(r,R). 


By Proposition B.1, f\(X), ..., f-(X) are all invariant polynomials on gl(r,R). They 
are the coefficients of the characteristic polynomial of —X. (The characteristic poly- 
nomial of X is det(AJ — X).) 


Example 23.2 (Trace polynomials). For any positive integer k, the polynomial ¥,(X) 
:= tr(X*) is clearly invariant under conjugation for any X € gl(r,R). By Proposi- 
tion B.1, ;,(X) is an invariant polynomial on gl(r,IR). It is the kth trace polynomial. 


If A is any commutative R-algebra with identity, then the canonical map R > A 


induces an algebra homomorphism R{[x}] > Alxi]. For any invariant polynomial 


P(X) on gl(r,R) and any invertible matrix A € GL(r,R), the polynomial 
P(X) = P(A~!XA) — P(X) 


is the zero polynomial in Ri). Under the homomorphism R[x] > Alx'], the zero 
polynomial P4(X) maps to the zero polynomial in A[x’;]. Thus, P(X) is identically 
zero when evaluated on A’*”. This means that for any invariant polynomial P(X) on 
gl(7,R) and A € GL(7,R), the invariance condition 


P(A~!XA) = P(X) 


holds not only for r x r matrices X of real numbers, but more generally for r x r 
matrices with entries in any R-algebra A with identity. 


23.2 The Chern—Weil Homomorphism 


Let E — M be a C™ vector bundle over a manifold M. We assume that there is a 
connection V on E; apart from this, there are no restrictions on E or M. Thus, we do 
not assume that FE is a Riemannian bundle or that M is compact or orientable. 

Let Q = [Q‘] be the curvature matrix of V relative to a frame e = [e; --- e,] onU. 
If é= [é --- @,] is another frame on U, then 


é€=ea 


for a C” function a: U -+ GL(r,R) and by Theorem 22.1, the curvature matrix Q 
relative to the frame @ is 


Q=a !Qa. 
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Let P be a homogeneous invariant polynomial of 
degree k on gl(r,R). Fix a point p € U. Let A be the 
commutative R-algebra of covectors of even degrees 
at p, 


A= DBA (TM). 
i=0 


IfQ= [Q'] is the curvature matrix of the connection 
V relative to the frame e for E over U, then Q, € A’ 
and 


Qp =a(p)' Qpa(p), 
where a(p) € GL(7,R). By the invariance of P(X) 
under conjugation, 


Shiing-Shen Chern 
(1911-2004) 


P(Qp) = P(a(p)~!Qpa(p)) = P(Qp) € (TEM). 


As p varies over U, 
Printed with permission of 


May Chou. P(Q) = P(Q)€ O?*(U). (23.2) 


Thus, the 2k-form P(Q) on U is independent of the frame. 

Let {Ug}aca be a trivializing open cover for E. On 
each Ug, choose a frame e% = [ef --- e] for E, and let Qe 
be the curvature matrix of the connection V relative to this 
frame. If P(X) is a homogeneous invariant polynomial of 
degree k on gl(r,R), then P(Qq) is a 2k-form on Ug. On the 
overlap Ug, Ug, we have two 2k-forms P(Qa) and P(Qg). 
By (23.2) they are equal. Therefore, the collection of forms 
{P(Qqa)}aca piece together to give rise to a global 2k-form 
on M, which we denote by P(Q). 

There are two fundamental results concerning the 
form P(Q). 


Theorem 23.3. Let E be a vector bundle of rank r on a man- 
ifold M, V a connection on E, and P an invariant homoge- 
neous polynomial of degree k on gl(r,R). Then (1906-1998) 


(i) the global 2k-form P(Q) on M is closed; 
(ii) the cohomology class of the closed form P(Q) in H7*(M) 
is independent of the connection. 


André Weil 


For any vector bundle E of rank r over M and any connection V on E, the map 
ce: Inv(gl(7,R)) > H*(M), 
P(X) > [P(Q)], 


sends P(X)Q(X) to [P(Q) A Q(Q)] and is clearly an algebra homomorphism. It is 
called the Chern—Weil homomorphism. 
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23.3 Characteristic Forms Are Closed 


In Examples 23.1 and 23.2 we introduced two sets of invariant polynomials on 
gl(r,R): 


(i) the coefficients f,(X) of the characteristic polynomial det(t/ +X), 
(ii) the trace polynomials £;,(X) = tr(X*). 


The polynomials f;,(X) and the trace polynomials X,(X) are related by Newton’s 
identity (Theorem B.14) 


De — file + fode—2 — 2 + (- 1) fe 21 + (— 1k fe = 0. 


These two sets of polynomials play a crucial role in the theory of characteristic 
classes, because of the following algebraic theorem. A proof of this theorem can 
be found in Appendix B. 


Theorem 23.4. The ring Inv (gl(r,R)) of invariant polynomials on gl(r,R) is gener- 
ated as a ring either by the coefficients f,(X) of the characteristic polynomial or by 
the trace polynomials X: 


Inv (gl(7,R)) =R[fi,...,f] =R,...,Z,]. 


In particular, every invariant polynomial P on gl(n,R) is a polynomial Q in 
d1,-..,2,. Thus, 
P(Q) = Q(21(Q),...,2-(Q)). 


This shows that it is enough to prove Theorem 23.3 for the trace polynomials, for if 
the X;,(Q) are closed, then any polynomial in the X;(Q) is closed, and if the coho- 
mology classes of the X,(Q) are independent of the connection V, then the same is 
true of any polynomial in the X;,(Q). 

Recall that if A = [a] and B = [B H| are matrices of C* forms on a manifold with 
the number of columns of A equal to the number of rows of B, then by definition 


() (AAB)) = Tey ABS 
(ii) (dA)i, = d(q). 


The next proposition collects together some basic properties about matrices of dif- 
ferential forms, their wedge product, and their trace. 


Proposition 23.5. Let A = [a] and B = [Bi] be matrices of C® forms of degrees a 


and b, respectively, on a manifold. 


Gi) IfA AB is defined, then 
(AAB)? = (-1)”BT AA’. 


(For matrices of functions, (AB)’ =B’ A"; for smooth forms, a \ B=(—1)” 
B Aa. Formula (i) generalizes both of these formulas.) 
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(ii) IfA \ B and BA are both defined, then 
tr(A A B) = (—1)“ tr(BAA). 
(iii) IFA = [a ‘| is a square matrix of differential forms on M, then 
dtrA =trdA. 


Proof. (i) It suffices to compute the (i, j)-entry of both sides: 


((AAB)")) = (AAB)} = Lay ai, 


(BT AAT), = DB") A (AT) = DBR A of. 
k 


k 


Since or) is an a-form and fly is a b-form, the proposition follows. 


(ii) 
tr(AAB) = DEN) Fa ENE, 
tr(BAA) = By (BAA) -SEi Ae 


(iii) 


dA =d( al) = Dida} = waa. 
i 


i 
Proof (of Theorem 23.3(i) for trace polynomials). Let @ and Q be the connection 
and curvature matrices relative to a frame e on U. Then 


dtr(Q*) = trd(ar*) (Proposition 23.5 (iii)) 
= tr(Q* \@— a@ AQ) (Generalized second Bianchi identity) 
= tr(Q* A @) — tr(@ AQ) 
=0 (Proposition 23.5 (ii)) 


23.4 Differential Forms Depending on a Real Parameter 


If @, is a C™ k-form on a manifold M depending on a real parameter t € J for some 
open interval J C R, then locally, on a chart (U,x!,...,x") for the manifold, @; is a 
linear combination 


0 = Yar(xt) dx, 
I 


where J = (i; <--- < ig) is a multi-index. We say that @, depends smoothly on 
t or varies smoothly with t if for every U in an atlas {(U,x',...,x")} for M, the 
coefficients a;(x,t) are all C” functions on U x J. In this case, we also say that 
{@,}:e) is a smoothly varying family of C* k-forms on M. 
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Suppose {@,};cy is such a family. For each p € M, the map t +> @p is a map 
from J to the vector space Ae M). We define da, /dt, also written @,, to be the 
k-form on M such that at each p € M, 


do) _ 4, 
di Jy ae” 


This definition makes sense because the vector space \(1;M ) is finite-dimensional. 
Locally, 


Similarly, for a < b in J, we define [' i @,; dt to be the k-form on M such that at 


each p € M, 
Ch ord) = [ arpat 
b b 
[oaa-Z([ cult) a (23.3) 
a T a 


All of these notions — C* dependence on ¢ and differentiation and integration with 
respect to tf — extend entry by entry to a matrix of forms depending on a real 
parameter f. 


Locally, 


Proposition 23.6. Suppose @ = @; and T = T are matrices of C™ differential forms 
on a manifold M depending smoothly on a real parameter t. Assume that the wedge 
product @ \ tT makes sense, i.e., the number of columns of @ is equal to the number 
of rows of T. 


(i) Ifo =| AG )] is a square matrix, then 


(ii) (Product rule) 


Ke =@MAT+OAT 
dt " 


(iii) (Commutativity of d/dt with a) “ (dan) = d (Sa). 


(iv) (Commutativity of integration he respect to t with d) If [a,b] C J, then 


b b 
/ aor =a f ovat). 
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Proof. (i) 


d d ; doi (do 
a —(tr@) = a — Ya} (t) = ii =tr (=) : 


The proofs of (ii) and (111) are straightforward. We leave them as exercises (Prob- 
lems 23.1 and 23.2). 

Since both sides of (iv) are matrices of differential forms on M, it suffices to prove 
the equality locally and for each entry. On a coordinate open set (U,x!,...,x”"), 


= Yaj(x,t)dx' and da, = asi nay! 
7 il ax 


By (23.3), 


[aoya=¥([’ sar) dx' \dx!. (23.4) 


a il 


By (23.3) and the definition of the exterior derivative on U, 


af @, dt = za((f cult) dt) ) 
= 35 (" ay( (nat) dx Nx’. (23.5) 


Comparing (23.5) and (23.4), we see that the proposition is equivalent to differ- 
entiation under the integral sign: 


b b 
7) ay(x,t)at= | S alestdt 


By a theorem of real analysis [15, §9.7, Proposition, p. 517], this is always possible 
for an integral over a finite interval [a,b] provided that a; and da;/0x' are continuous 
on U x [a,b]. 


23.5 Independence of Characteristic Classes of a Connection 


Recall from Proposition 10.5 that a convex linear combination of connections on a 

vector bundle is again a connection. Given two connections V° and V! on a smooth 

vector bundle 2: E — M, we define for each t € R the convex linear combination 
Vv! = (1-1)V°+rV!, 


and then V’ is a connection on E. 
Let @, and Q, be the connection and curvature matrices of V‘ relative to a frame 
e1,---,éy for E over an open set U. For X € X(U), 


Vie; = (1—1)Vhe;+tV xe; 
=¥ (1-9 (@)) +1(@1)}) Wer 


L 
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Hence, 

@, = (1—t)@)+ta. 
This proves that the connection matrix @, depends smoothly on t. By the second 
structural equation, the curvature matrix Q, also depends smoothly on f. 

To show that the cohomology class of tr(Q*) is independent of f, it is natural 
to differentiate tr(Qk ) with respect to ¢ and hope that the derivative will be a global 
exact form on M. In the following proof, in order to simplify the notation, we often 
suppress the wedge product. 


Proposition 23.7. If V‘ is a family of connections on a vector bundle E whose con- 
nection and curvature matrices @, and Q, on a framed open set U depend smoothly 
on a real parameter t, then 
d 
7 tr") = d(ktr(Qk'@)), 


where Q = Q,. 
Proof. 
kad (rQ*) =tr (52) (Proposition 23.6 (i)) 
dt dt 
= 1r(QQ*! +QQ0%7 +---+.0*'Q) (product rule) 
= ktr(Qk!Q) (Proposition 23.5 (ii)) 


To show that tr(Q‘~'Q) is an exact form, we differentiate the structural equation 
Q = da@+@ A @ with respect to f: 


. d d 
Q=— a 
Hao t a (or @) 
=d@+0A0+Q0AQ0, 


by the commutativity of d/dt with d (Proposition 23.6(iii)) and the product rule 
(Proposition 23.6(ii)). Hence, 


tr(Q* 1) = (Qk 'do+a*!o@o+Q'! oo) 
= r(Q* do — 02 !o@+0%'o@) 


(by Proposition 23.5(ii), there is a sign change 


because both Q*—! @ and @ have odd degree) 
= tr (Q* da + (dQ*')@) 
(generalized second Bianchi identity) 


= tr (d(Q‘'@)) =d(tr(Q*"'@)) (Proposition 23.5 (iii)). 


Remark 23.8. Under a change of frame é = ea, 


O, = a !@,a+a ‘da. 
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Differentiating with respect to f kills a~'da, since a does not depend on t, hence 
@ =a 'oa. 
Thus, 
tr(Qk'@) = tr(a OQ laa! @a) = tr(a 1a) = r(Q*'@). 


This shows that tr(Q*‘~!@) is independent of the frame and so can be pieced together 
into a global form on M. 


Proof (of Theorem 23.3(ii)). Suppose V° and V! are two connections on the vector 
bundle FE. Define a family of connections 


Vi = (1—2)V°+2V', teR, 
as before, with connection matrix @, and curvature matrix Q, on a framed open set. 
By Proposition 23.7, 
d - 
qt) = d(ktr(Qk!@,)). 
Integrating both sides with respect to ¢ from 0 to | gives 


: d k k | k k 
i (Of) dt = w(O8)] = (Qf) — (Oh) 


and 
1 1 
| aku(a'6))ar=a | ktr(Q* |! @) dt 
0 0 


by Proposition 23.6(iv). 
Thus, 


1 
r(Q’) — tr(Qk) =a | ktr(Q!@) dt 
0 


In this equation, both tr(Q*) and tr(Qk) are global forms on M. As noted in Re- 
mark 23.8, so is tr(Q*!@). Passing to cohomology classes, we get [tr(Q‘)] = 
(tr(Q4)]. This proves that the cohomology class of tr(Q*) is independent of the 
connection. 


For any invariant homogeneous polynomial P(X) of degree k on gl(r,R), since P(Q) 
is a polynomial in tr(Q), tr(Q7), ...,tr(Q*), the characteristic form P(Q) is closed 
and the cohomology class of P(Q) is independent of the connection. 


23.6 Functorial Definition of a Characteristic Class 


A characteristic class on real vector bundles associates to each manifold M a map 


. haunt caer classes of ia +> H*(M). 
vector bundles over M 
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such that if f: N — M is a map of C® manifolds and E — M is a vector bundle over 
M, then 


cy (f*E) = fi cm (E). (23.6) 

Let Vect,(M) be the set of all isomorphism classes of rank k vector bundles 
over M. Both Vect;() and H*( ) are functors on the category of smooth manifolds. In 
functorial language a characteristic class is a natural transformation c: Vect,( ) > 


H*( ). By the definition of a natural transformation, if f: N — M is a C® map of 
manifolds, then there is a commutative diagram 


Vect;(N) N) —“ > H* N) 


ro 


Vect;,(M) ——> H*(M), 


cM 


which is precisely (23.6). For this reason, property (23.6) is often called the natu- 
rality property of the characteristic class c. 


23.7 Naturality 


We will now show that if E — M is a vector bundle of rank r and P is an 
Ad (GL(r,R))-invariant polynomial on gl(r,R), then the cohomology class [P(Q)| 
satisfies the naturality property and therefore defines a characteristic class. 

If V is aconnection on E, with connection matrices @, relative to frames e for E, 
by Theorem 22.10 there is a unique connection V’ on f*E — N whose connection 
matrix relative to the frame f*e is f*(@e). 

The induced curvature form on f*E is therefore 


f*@e+ = sl We, f* We] = f* Qe. 
Since f* is an algebra homomorphism, 
P(F'Qe) = f°P(Qe), 
which proves the naturality of the characteristic class [P(Q)]. 


Problems 


23.1. Product rule for forms depending on ¢ 


(a) Suppose @ = @; and Tt = 7 are C™ differential forms on M depending smoothly on a real 
parameter t. Denote the derivative of @ by dw/dt or @. Prove the product rule 


d : : 
OAT) = OATH OAT. 
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(b) Prove the same formula if @ and T are matrices of k-forms and ¢-forms on M depending 
smoothly on ¢. 

(c) If a, B, y are matrices of differential forms on M depending smoothly on t € J for which 
a/ B A is defined, prove that 


“(aBy) = aBy + oBy+ aBy. 


23.2. Commutativity of d/dt with the exterior derivative 
Suppose @, is a C® k-form on a manifold M depending smoothly on ¢ in some open interval 
J CR. Prove that 

d 


dm) = d(@). 


(Hint: Write out q@, in terms of local coordinates x!,...,x” near D-) 
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$24 Pontrjagin Classes 


Suppose F' is a smooth vector bundle of rank 
r over a manifold M. In the preceding section 
we showed that starting with any connection 
V on E, if P(X) is a homogeneous invari- 
ant polynomial of degree k on gl(r,R) and Q 
is the curvature matrix of V relative to any 
frame for E, then P(Q) defines a closed global 
2k-form on M, whose cohomology class is in- 
dependent of the connection. The cohomol- 
ogy class [P(Q)] is called the characteristic 
class associated to P of the vector bundle E. 
This construction gives rise to the Chern—Weil 
homomorphism, 


ce: Inv (gl(r,R)) > H*(M) 


from the algebra of invariant polynomials on 
gl(7,R) to the de Rham cohomology algebra 
of M. The image of the Chern—Weil homo- 
morphism is precisely the subalgebra of char- 
acteristic classes of E associated to invariant 
polynomials. 

Since the algebra Inv (gl(r,R)) of invari- 
ant polynomials on gl(r, IR) is generated by the 


Lev Pontrjagin 


(1908-1988) 


coefficients f,(X) of the characteristic polynomials and by the trace polynomials 
tr(X k ) (Appendix B, Theorem B.17), to determine all the characteristic classes of E, 
it suffices to calculate the characteristic classes associated to either set of polynomi- 
als. The characteristic classes arising from the f;,(X) are called Pontrjagin classes. 


24.1 Vanishing of Characteristic Classes 


In Section 11.4, we learned that on a Riemannian bundle FE over a manifold M the 
curvature matrix of a metric connection relative to an orthonormal frame is skew- 


symmetric: 


0 Qh... 
-a} Oo See 


_Q! 


From the skew-symmetry of the curvature matrix, we see that f;(Q) = tr(Q) = 0. 
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This example is an instance of a general phenomenon: if k is odd, then the coho- 
mology class [tr(Q*)] = 0 for any connection on any vector bundle. To see this, we 
begin with two simple algebraic lemmas. 


Lemma 24.1. [f A is a skew-symmetric matrix of 2-forms on a manifold, then Ak is 
symmetric for k even and skew-symmetric for k odd. 


Proof. By Proposition 23.5(i), 
(AAA)? =A’ AAT = (—A)A(-A) =AAA 


and 


((AAA) AA)! =A? A(A? AA’) = (—A) A(—A) A (—A) = —AAAAA. 


Hence, A? is symmetric and A* is skew-symmetric. The general case proceeds by 
induction. 


Lemma 24.2. Suppose 2 is not a zero divisor in a ring R. Then the diagonal entries 
of a skew-symmetric matrix over R are all zero. 


Proof. A skew-symmetric matrix is a square matrix A = [a'] such that A? = —A. 
This implies that ai = —di. Hence, 2a' = 0. Since 2 is not a zero divisor in R, one 
has a; = 0 for all i. 


Next, we recall a few facts about Riemannian metrics, connections, and curvature 
on a vector bundle. 


(i) It is possible to construct a Riemannian metric on any smooth vector bundle 
(Theorem 10.8). 
(ii) On any Riemannian bundle there is a metric connection, a connection compati- 
ble with the metric (Proposition 10.12). 
(iii) Relative to an orthonormal frame, the curvature matrix Q of a metric connection 
is skew-symmetric (Propositions 11.4 and 11.5). 


By Lemma 24.1, if Q is the curvature matrix of a metric connection relative to 
an orthonormal frame, then for an odd k the matrix Q* is skew-symmetric. Since the 
diagonal entries of a skew-symmetric matrix are all zero, tr(Q*) = 0 for all odd k. 

Even if the frame é is not orthonormal, the same result holds for the curvature 
matrix of a metric connection. This is because the curvature matrix relative to @ is 
Q = a~!Qa, where a is the change of basis matrix defined by @ = ea. Since the trace 
polynomial tr(X*) is invariant under conjugation, if Q is the curvature matrix relative 
to an orthonormal frame e, then 


tr(Q*) = tr(a!OKa) = tr(Q*) = 0 for k odd. 


If the connection V on E is not compatible with a metric on E, then tr(Q*) need 
not be zero. However, by Theorem 23.3(ii) the cohomology class [tr(Q*)] is still 
zero, because the cohomology class is independent of the connection. 
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Example. Let E be any smooth vector bundle over M. Put a Riemannian metric on E 
and let V be a metric connection on E, with curvature matrix Q on a framed open set. 
Suppose P(X) is a homogeneous invariant polynomial of degree 3 on gl(r,IR). Since 
the trace polynomials X), Xo, X3,... generate the ring of invariant polynomials, P is 
a linear combination of monomials of degree 3 in the trace polynomials. There are 
only three such monomials, bye X25 and X&3. Hence, 


P(X) = ad, (X)? + bd) (X)Z2(X) +23(X) 


for some constants a,b,c € R. Since the odd trace polynomials Z; (Q) and X3(Q) are 
both zero, P(Q) = 0. Thus, the characteristic class of a vector bundle associated to 
any homogeneous invariant polynomial of degree 3 is zero. 


Clearly, there is nothing special about degree 3 except that it is odd. 


Theorem 24.3. [f a homogeneous invariant polynomial P(X) on gl(r,R) has odd 
degree k, then for any connection V on any vector bundle E — M with curvature 
matrix Q, the cohomology class [P(Q)] is zero in H**(M). 


Proof. Put a Riemannian metric on E and let V' be a metric connection on E, with 
curvature matrix Q' on a framed open set. By Theorem 23.4, P(X) is a linear combi- 
nation with real coefficients of monomials Z}' -- -L'r in the trace polynomials. Since 
k is odd, every monomial term of P(X) has odd degree and so must contain a trace 
polynomial 2; of odd degree j as a factor. The vanishing of the odd trace polynomi- 
als Z;(Q) implies that P(Q’) = 0. By Theorem 23.3(ii), if V is any connection on E 
with curvature matrix Q, then [P(Q)] = [P(Q’)] = 0. 


24.2 Pontrjagin Classes 


To calculate the characteristic classes of a vector bundle E — M, we can always use 
a metric connection V compatible with some Riemannian metric on E. Let Q be the 
curvature matrix of such a connection V on a framed open set. From the proof of 
Theorem 24.3, we see that for k odd, the polynomials tr(Q*) and f;(Q) are all zero. 
Thus, the ring of characteristic classes of FE has two sets of generators: 


(i) the trace polynomials of even degrees 
[tr(27)], [tr(Q*)}, [tr(Q°)),... 


and 
(ii) the coefficients of even degrees of the characteristic polynomial det(AJ + Q) 


[f2(Q)], [fa(Q)], [fo(Q)],---- 


Definition 24.4. The kth Pontrjagin class p,;(E) of a vector bundle E over M is 
defined to be 
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i 
px(E) = iB (-2)| € H**(M). (24.1) 


In this definition the factor of i is to ensure that other formulas will be sign-free. 
Since fo; is homogeneous of degree 2k, the purely imaginary number i disappears in 
fox ((i/27)Q). A closed differential g-form on M is said to be integral if it gives an 
integer when integrated over any compact oriented submanifold of dimension g of 
M. The factor of 1/27 in the definition of the Pontrjagin class in (24.1) ensures that 
px(E) is the class of an integral form. If the rank of E is r, then Q is an r x r matrix 
of 2-forms. For any real number x € R, let |x| be the greatest integer < x. Then one 
can collect together all the Pontrjagin classes in a single formula: 


4 o)\-ars ge (to\art AG ede (2 
det (ar sa) =A'+fi (5-2) M+ hf2 (5-2) ATS +++ f, (5-2) 
SAT + pia? $e + pig ar 78), 


where r is the rank of the vector bundle E. Setting A = 1, we get 
i 
det (14 22) = a os Me aeeee 4p 


We call this expression the total Pontrjagin class of E and denote it by p(E). Note 
that pe = fax (32) € H**(M). 
Let E be a vector bundle of rank r over a compact oriented manifold M of di- 


mension 4m. A monomial pi! p5?-:- pie i of weighted degree 


r 


4 (a1 +2a+---+|5 | a15)) =4m 


represents a cohomology class of degree 4m on M and can be integrated over M; the 
resulting number fy, yO oN vee Pinal is called a Pontrjagin number of E. The Pon- 
trjagin numbers of a manifold M of dimension 4m are defined to be the Pontrjagin 


numbers of its tangent bundle TM. 


Example 24.5. For a compact oriented manifold M of dimension 4, the only Pontrja- 
gin number is fy, p1. For a compact oriented manifold of dimension 8, there are two 
Pontrjagin numbers Jy, pj and fy p2- 


24.3 The Whitney Product Formula 
The total Pontrjagin class of a direct sum of vector bundles can be easily computed 
using the Whitney product formula. 


Theorem 24.6 (Whitney product formula). /f E’ and E" are vector bundles over 
M, then 


P(E’ ®E") = p{E')p(E"). 
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Proof. Let V’ and V” be connections on the vector bundles E’ and E", respectively. 
We define a connection V := V’@ V” on E := E’ GE" by 


/ Pat 
Vx M = Eel for X € ¥(M),s' €T(E’),s" €T(E"). 
Ss Vs 


and 


Vx | = (Xf ) H +f Vx y : 


Therefore, V is a connection on E. 

Let e’ = [e| --- e/.] and e” = [e/ --- ef] be frames 
for E’ and E”, respectively, over an open set U. Then 
e= le’ e”| isa frame of E’@ E” over U. Suppose the 
connection matrices of V’ and V” relative to e’ and 
e” are w' and w” and the curvature matrices are Q! 
and Q", respectively. Then the connection matrix of 


V relative to the frame e is 


Hassler Whitney 
wo’ 0 
o= i ra (1907-1989) 


and the curvature matrix is 
d 
Q=do+o\o0= 
Q' 0 
= 0 Q"\" 
Therefore, the total Pontrjagin class of E’ @ E” is 
ji Wy i [Q' 0 


_ 1+3Q/ 0 
=aet| 0 1+ ta" 


— det (74+ a’ ) det (7+ 0" 
20 20 


= p(E')p(E"). 7 
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For dimension reasons alone, we cannot expect the Pontrjagin classes to give a gener- 
alized Gauss—Bonnet theorem. A Pontrjagin number makes sense only on a compact 
oriented manifold of dimension 4m, whereas a generalized Gauss—Bonnet theorem 
should hold for a compact oriented manifold of any even dimension. 

We used a Riemannian structure on a vector bundle E to show that the char- 
acteristic classes defined by Ad (GL(r, R))-invariant polynomials of odd degree all 
vanish, and we used the orientability of the manifold M to integrate a top form, but 
we have not used the orientability assumption on the bundle E. This is the final 
ingredient for a generalized Gauss—Bonnet theorem. 

A C® complex vector bundle is a smooth map m: E — M of manifolds that is 
locally of the form U x C’ + U. Just as complex numbers are necessary in the theory 
of polynomial equations, so complex vector bundles play an essential role in many 
areas of geometry. Examples abound: the tangent bundle of a complex manifold, 
the tautological subbundle on a complex Grassmannian, the hyperplane bundle of a 
complex projective variety. 

Most of the concepts we developed for real vector bundles have obvious counter- 
parts for complex vector bundles. In particular, the analogues of Pontrjagin classes 
for complex vector bundles are the Chern classes. 

In this section we define the Euler class of a real oriented vector bundle of even 
rank and the Chern classes of a complex vector bundle. 


25.1 Orientation on a Vector Bundle 


Recall that an orientation on a vector space V of dimension r is an equivalent class 
of ordered bases, two ordered bases [u; ... u;] and [v; ... v,] being equivalent if and 
only if they are related to each other by multiplication by a nonsingular r x r matrix 
of positive determinant: 


[vy eae vy] — [uy tee u,lA, detA > 0. 


Thus, every vector space has exactly two orientations. 
Each ordered basis v;,...,v, determines a nonzero element v; \---/A v; in the rth 
exterior power /A\’ V (Lemma 19.7). If 


[vy ee v;] = [uy eee u,lA, 
then by the same calculation as in Proposition 16.8, 
vp A+++ Av, = (detA) uy A+++ A uy. 


Thus, an orientation on V can also be represented by a nonzero element in \’V ~ R, 
with two nonzero elements representing the same orientation if and only if they are 
positive multiples of each other. 
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This suggests how to define orientation on a vector bundle. 


Definition 25.1. An orientation on a vector bundle E — M of rank r is an equiv- 
alence class of nowhere-vanishing C® sections of the line bundle A’ £, two such 
sections s,s’ being equivalent if and only if they are related by multiplication by a 
positive function on M: s’ = fs, f > 0. 


Intuitively this means that each fiber E, has an orientation which varies with 
p€M inaC way. By Problem 25.1, a line bundle has a nowhere-vanishing C™ 
section if and only if it is a trivial bundle, so we have the following proposition. 


Proposition 25.2. A vector bundle E of rank r has an orientation if and only if the 
line bundle /\’ E is trivial. 


If a vector bundle has an orientation, we say that it is orientable. On a connected 
manifold M an orientable vector bundle E of rank r has exactly two orientations, 
corresponding to the two connected components of A’ E. An orientable vector bun- 
dle together with a choice of orientation is said to be oriented. In this language, a 
manifold is orientable if and only if its tangent bundle is an orientable vector bundle. 


25.2 Characteristic Classes of an Oriented Vector Bundle 


Let E — M be an oriented Riemannian bundle of rank r, and V a connection on E 
compatible with the metric. A frame e = [e; --- e,] for E over an open set U is said to 
be positively oriented if at each point p in M it agrees with the orientation on E. We 
agree to use only positively oriented orthonormal frames to compute the connection 
matrix @ and the curvature matrix Q of V. By Propositions 11.4 and 11.5, relative to 
the orthonormal frame e, both @ and Q are skew-symmetric. 

If é is another positively oriented frame for E over U, then 


é€é=ea (25.1) 


for a C* function a: U + SO(r). Let Q be the curvature matrix relative to the 
frame @. Under the change of frame (25.1) the curvature matrix Q transforms as in 
Theorem 22. 1 (ii): 

Q=a'Qa, 


but now a is a special orthogonal matrix at each point. 

Hence, to get a global form P(Q) on M, we do not need the polynomial P(X) to 
be invariant under conjugation by all elements of the general linear group GL(7,R), 
but only by elements of the special orthogonal group SO(r). A polynomial P(X) for 
X € R’™" is Ad (SO(r))-invariant if 


P(A~!XA) = P(X) for all A € SO(r). 


Of course, every Ad (GL(r,R))-invariant polynomial is also Ad (SO(r)) -invariant, 
but might there be an Ad (SO(r))-invariant polynomial that is not Ad (GL(r,R))- 
invariant? Such a polynomial P would give us a new characteristic class. Note that 
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for an oriented Riemannian bundle, the curvature matrix Q relative to an orthonormal 
frame is skew-symmetric, so the polynomial P need not be defined on all of gl(7,R), 
but only on the subspace of skew-symmetric matrices. The space of all r x r skew- 
symmetric matrices is the Lie algebra of SO(r), denoted by so(r). Let X = [x‘] be 
an r x r skew-symmetric matrix of indeterminates. Thus, a = —xi forl<i<j<r, 
and x! = 0 for all i. A polynomial on so(r) is an element of Ri], where the Ms l< 
i,j <r, are indeterminates satisfying the skew-symmetry condition. Alternatively, 
we may view a polynomial on so(r) as an element of the free polynomial ring Rix], 
1<i<j<vr, where the xi are algebraically independent elements over R. We now 
have a purely algebraic problem: 

Problem. What are the Ad (SO(r))-invariant polynomials on so(r), the r x r skew- 
symmetric matrices? 


We already know some of these, namely the Ad (GL(r, R)) -invariant polynomi- 
als, which are generated as a ring either by the trace polynomials or by the coe- 
fficients of the characteristic polynomial. It turns out that for r odd, these are the 
only Ad (SO(r)) -invariant polynomials. For r even, the ring of Ad (SO(r))-invariant 
polynomials has an additional generator called the Pfaffian, which is a square root 
of the determinant. 


25.3 The Pfaffian of a Skew-Symmetric Matrix 


Two r x r matrices X and Y with entries in a field F 
are said to be congruent if there is an r x r nonsin- 
gular matrix A € GL(r,F) such that Y = A?XA. Bya 
theorem in linear algebra [13, Th. 8.1, p. 586], every 
skew-symmetric matrix X over any field F is congru- 
ent to a matrix of the form: 


S 


ATXA= 


0 


Johann Friedrich Pfaff 
(1765-1825) 


where S is the 2 x 2 skew-symmetric matrix 


01 
s=| oa) 
Since A is nonsingular, it follows that a nonsingular skew-symmetric matrix X has 
even rank. 
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If X is nonsingular, then 
ATXA= 


Moreover, for such a matrix X, 


det(A7 XA) = (detA)?(detX) = (det S)---(detS) = 1. 


1 \2 
detX = | —— ] €F 
detA 


Thus, the determinant of a skew-symmetric matrix is a perfect square in the field. In 
fact, more is true. 


So 


Theorem 25.3. Let X = [x‘] be a2m X 2m skew-symmetric matrix of indeterminates. 
Then det(X) is a perfect square in the polynomial ring Z{x\)]. 


Proof. Let Q(x}) be the field generated over Q by the indeterminates x, We know 
that detX #0 in Z\x'], because there exist values of x’, in Z for which detX #0. 
Thus, X = |x')] is a 2m x 2m nonsingular skew-symmetric matrix over the field Q(x;). 
By the discussion above, there is a matrix A € GL (2m, Q(x;)) such that 


i? 
detX = (aaa) € QU). 


This proves that detX is a perfect square in Q(x), the fraction field of the integral 


domain Z{x'). Suppose 
p(x) \* 
detX = ( ) ; 
q 
where p(x), q(x) € Z[x'] are relatively prime in Z{x'}. Then 
q(x)? detX = p(x)? € Z[x']. 
Since Z{x'] is a unique factorization domain, g(x) must divide p(x), but g(x) and 


p(x) are relatively prime, so the only possibilities are q(x) = +1. It follows that 
detX = p(x)* in VANE 


The Pfaffian of a 2m x 2m skew-symmetric matrix X of indeterminates is defined 
to be a polynomial Pf(X) such that 


det(X) = (Pf(X))*; 
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however, since any perfect square in a ring has two square roots (as long as the char- 
acteristic of the ring is not 2), a normalization condition is necessary to determine the 
sign of Pf(X). We adopt the convention that on the standard skew-symmetric matrix 


S 
01 
Jom = me, ’ s=|_f ae 


we have Pf(J2»,) = 1. The Pfaffian Pf(X) is a polynomial of degree m on s0(2m). 


_[ 0% 
«=| 43] 
then det(X) = (x4)? and Pf(X) = +x}. Since Pf(S) = 1, the sign must be positive. 
Thus, PFC) = 24. 


Example 25.4. If 


Proposition 25.5. Let A = [a'] and X = [x4] be 2m x 2m matrices of indeterminates 


with X skew-symmetric. Then 
Pf(A7 XA) = det(A) Pf(X) € Z{a’,,x']. 
Proof. By the properties of the determinant (Example B.3), 
det(A7 XA) = (detA)? det X. (25.2) 
Since A’ XA and X are both skew-symmetric, (25.2) is equivalent to 
(Pf(A7XA))” = (detA)?(PFX)’. 


Therefore, 


Pf(A7XA) = +det(A) Pf(X). (25.3) 


Since Pf(A7 XA) and det(A) Pf(X) are two uniquely defined elements of the ring 
Z{a'.,x'], the sign does not depend on A or X. To determine the sign, it suffices to 
evaluate (25.3) on some particular A and X with integer entries. Taking A = hm = 
the 2m x 2m identity matrix and X = Jz, we get 


Pf(A XA) = Pf(J2) = 1 = det(A) Pf(X). 


Therefore, the sign is + and 


Pf(A’ XA) = det(A) Pf(X). 


From the proposition, it follows that if X € so(r) and A € SO(r), then 
Pf(A~!XA) = Pf(A? XA) = det(A) Pf(X) = Pf(X). 


Hence, the Pfaffian Pf(X) is an Ad (SO(r))-invariant polynomial on r x r skew- 
symmetric matrices. 
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25.4 The Euler Class 


If a € SO(2m), then a~! = a? and deta = 1. Coming back to the curvature matrix 
Q relative to a positively oriented orthonormal frame e of an oriented Riemannian 
bundle E — M over an open set U, we see that if Q is the curvature matrix relative to 
another positively oriented orthonormal frame é for E|y, then by Theorem 22.1 (ii) 
the Pfaffian satisfies 


Pf(Q) = Pf(a~!Qa) = Pf(a’ Qa) 
= det(a) Pf(Q) = Pf(Q). 


So Pf(Q) is independent of the positively oriented orthonormal frame e. As before, 
Pf(Q) defines a global form on M. One can prove that this global form is closed 
and that its cohomology class [Pf(Q)] € H?”(M) is independent of the connection 
[17, pp. 310-311]. Instead of doing this, we will give an alternate construction in 
Section 32 valid for any Lie group. The class e(E) := [Pf((1/27)Q)] is called the 
Euler class of the oriented Riemannian bundle E. 


25.5 Generalized Gauss—Bonnet Theorem 


We can finally state, though not prove, a generalization of the Gauss—Bonnet theorem 
to higher dimensions. 


Theorem 25.6. ({19, Vol. 5, Ch. 13, Th. 26, p.404], [17, p. 311]) Let M be a com- 
pact oriented Riemannian manifold M of dimension 2m, and V a metric connection 
on its tangent bundle TM with curvature matrix Q relative to a positively oriented 


orthonormal frame. Then 
1 
Pf{| —Q)=y7(M). 
[pe( 9) =x 


Example 25.7. When M is a compact oriented surface and e1,e2 is a positively ori- 
ented orthonormal frame on M, we found the connection and curvature matrices 
to be 


1 
o-| al Q=do+0ho=do=| 


0 da} 
—a1 0 


—do} 0 


(see Section 12.1.) By Theorem 12.3, do} = K@! A 0”, where K is the Gaussian 
curvature and @!, 9” the dual frame to e1,e2. So 


Pf(Q) = da} = Ko! Ae. 


From the generalized Gauss—Bonnet theorem, we get 


[erm = [ er(£0) = [ Kone = xm), 


which is the classical Gauss—Bonnet theorem for a compact oriented Riemannian 
2-manifold. 
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25.6 Hermitian Metrics 


On the complex vector space C”, the Hermitian inner product is given by 
n 
(z,w) = ¥ zw; forz,weC". 
j=l 


Extracting the properties of the Hermitian inner product, we obtain the definition of 
a complex inner product on a complex vector space. 


Definition 25.8. A complex inner product on a complex vector space V is a 
positive-definite, Hermitian-symmetric, sesquilinear form ( , ): Vx V + C. This 
means that for u,v,w € V anda,b€ C, 


(i) (positive-definiteness) (v, v) > 0; the equality holds if and only if v = 0. 
(ii) (Hermitian symmetry) (u,v) = (v,u). 
(iii) (sesquilinearity) ( , ) is linear in the first argument and conjugate linear in the 
second argument: 


(au+ bv,w) = a(u,w) +b(v,w), 
(w,au+ bv) = a(w,u) + b(w,v). 


In fact, for sesquilinearity it is enough to check linearity in the first argument. 
Conjugate linearity in the second argument follows from linearity in the first argu- 
ment and Hermitian symmetry. 

A Hermitian metric on a C~ complex vector bundle E — M assigns to each 
p © M acomplex inner product ( , ), on the fiber E,; the assignment is required 
to be C® in the following sense: if s and ¢ are C® sections of E, then (s,t) is a C® 
complex-valued function on M. A complex vector bundle on which there is given a 
Hermitian metric is called a Hermitian bundle. 

It is easy to check that a finite positive linear combination ,a;( , ); of complex 
inner products on a complex vector space V is again a complex inner product. A 
partition of unity argument as in Theorem 10.8 then proves the existence of a C™ 
Hermitian metric on any C” complex vector bundle. 


25.7 Connections and Curvature on a Complex Vector Bundle 


A connection on a complex vector bundle E — M is defined in exactly the same 
way as a connection on a real vector bundle (Definition 10.1): it is an R-bilinear 
map V: X(M) x T(E) + T(E) that is F-linear in the first argument, C-linear in 
the second argument, and satisfies the Leibniz rule in the second argument. Just 
as in Theorem 10.6, every C* complex vector bundle E over a manifold M has a 
connection. 

On a Hermitian bundle E + M, a connection V is called a metric connection 
and is said to be compatible with the Hermitian metric if for all X € X(M) and 
s,t E T(E), 

X(s,t) = (Vx5,t) + (s, Vxt). 
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Relative to a frame e;,...,e, over U for a complex vector bundle E, a connection 
V can be represented by a matrix [oi] of complex-valued 1-forms: 


‘ 
Vxej= > W(X )ei. 
i=1 


This is just like the real case except that now OF are complex-valued 1-forms on U. 
For X,Y € X(M), the curvature R(X,Y): T(E) > T(E) of the connection is defined 
as before: 


R(X,Y) = VyVy —VyVx Vix.y]- 


Relative to a frame e),...,e, for E over U, curvature is represented by a matrix [Q5] 
of complex-valued 2-forms. We again have the structural equation 


Q=doa+aoara. 


25.8 Chern Classes 


Under a change of frame é = ea, where a € GL(r,C), the connection and curvature 
matrices transform by the formulas (Theorem 22.1) 


O= a '@a+a ‘da, 

Q =a 'Qa. 
Thus, a complex polynomial Q on gl(r,C) that is invariant under conjugation by 
elements of GL(r7,C) will define a global form Q(Q) on M. As before, one shows 


that [Q(Q)] is closed and that the cohomology class [Q(Q)] is independent of the 
connection. Taking Q(Q) = det(J+ 5-Q), we obtain the Chern classes c;(E) of E 


from 
V-1 
det (14 2) =1+c)(E)+---+c¢,(E). 


The same argument as in Section 24 proves that Chern classes satisfy the natu- 
rality property and the Whitney product formula. 


Problems 


25.1. Trivial line bundle 
A line bundle is a vector bundle of rank 1. Prove that if a C® line bundle E + M has a 
nowhere-vanishing C™ section, then it is a trivial bundle. 


25.2. The determinant and the Pfaffian 


For 
0 abe 
—a 0 dee 
- —b-d 0 f}’ 
—c-e-f 0 


find det(X) and Pf(X). 


25.3. Odd-dimensional skew-symmetric matrices 
Using the definition of a skew-symmetric matrix A? = —A, prove that for r odd, an rx r 
skew-symmetric matrix is necessarily singular. 
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The Pontrjagin classes, the Euler class, and the Chern classes are collectively called 
characteristic classes of a real or complex vector bundle. They are fundamental 
diffeomorphism invariants of a vector bundle. Applied to the tangent bundle of a 
manifold, they yield diffeomorphism invariants of a manifold. We now give a sam- 
pling, without proof, of their applications in topology and geometry. 


26.1 The Generalized Gauss—Bonnet Theorem 


For a compact, oriented Riemannian manifold M of even dimension 2n, the Euler 
class e(TM) of the tangent bundle is represented by a closed 2n-form in the entries 
of the curvature matrix of M. As we said earlier, the generalized Gauss—Bonnet 
theorem states that [17, p. 311] 


| e(TM) = 7(M). 
M 


26.2 Characteristic Numbers 


On a compact oriented manifold M of dimension 4n, a monomial pi --- pin has total 
degree 
4i; + 8i2 +-+-+4nin. 


If the total degree happens to be 4n, then the integral fy, Py --- pin is defined. It 
is called a Pontrjagin number of M. The Pontrjagin numbers are diffeomorphism 
invariants of the manifold M; in fact, they turn out to be topological invariants as 
well. 

The complex analogue of a smooth manifold is a complex manifold. It is 
a second countable, Hausdorff topological space that is locally homeomorphic 
to an open subset of C”; moreover, if da: Ua + ba(Ua) CC" and og: Ug > 
op (Ug) C C” are two such homeomorphisms, then the transition function $g © 
Ga! : Ga(UaNUg) — og (UaN Us) is required to be holomorphic. If (U,z',...,z”) is 
a chart of the complex manifold, then the holomorphic tangent space 7;,,M has basis 
(0/dz')p,...,(0/z")p. Just as in the real case, the disjoint union [],<y TpM has 
the structure of a complex vector bundle over M, called the holomorphic tangent 
bundle. Let c,,...,€, be the Chern classes of the holomorphic tangent bundle of a 
complex manifold of complex dimension n. The degree of the monomial c’! s+ cln 
is 2i; + ---+ 2ni,. If this degree happens to be the same as the real dimension 2n 
of the manifold, then the integral [y,c/! -- -cin is defined and is called a Chern num- 
ber of M. The Chern numbers are diffeomorphism invariants and also topological 
invariants of a complex manifold. 
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26.3 The Cobordism Problem 


Two oriented manifolds M, and M are said to be cobordant if there exists an oriented 
manifold N with boundary such that 


ON =M,—M), 


where —M) denotes M) with the opposite orientation. An oriented manifold is cobor- 
dant to the empty set if it is the boundary of another oriented manifold. The Pon- 
trjagin numbers give a necessary condition for a compact oriented manifold to be 
cobordant to the empty set. 


Theorem 26.1. [f a compact oriented manifold M of dimension 4n is cobordant to 
the empty set, then all the Pontrjagin numbers of M vanish. 


Proof. Suppose M = ON for some manifold N with boundary and that pi --+ pin has 
degree 4n. Then 


[vt oie = | pil --+ pin 
Mo Jaye Be 


=| d(p'} ---pin) (by Stokes’ theorem) 
N 
=0 


since Pontrjagin classes are represented by closed forms and a product of closed 
forms is closed. 


Corollary 26.2. [f two compact oriented manifolds M, and M) are cobordant, then 
their respective Pontrjagin numbers are equal. 


Proof. If M, — Mz = ON, then Ju, _Mp pi --- pin = 0 by Theorem 26.1. Hence, 


iy in _ iy i 
pipe =f Pio Ph- 
M M2 


26.4 The Embedding Problem 


According to Whitney’s embedding theorem, a smooth manifold of dimension n 
can be embedded in R7”*!. Could one improve on the dimension? In some cases 
Pontrjagin classes give a necessary condition. 


Theorem 26.3. [f a compact oriented manifold M of dimension 4n can be embedded 
as a hypersurface in R*"*", then all the Pontrjagin classes of M vanish. 


Proof. Since M is compact oriented in R*"*!, it has an outward unit normal vector 
field and so its normal bundle N in R*”*! is trivial. From the short exact sequence of 
C™ bundles, 

03 TM > TR*"*||y > N 0, 
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we obtain a direct sum 
TR ly ~TMON. 


By the Whitney product formula for Pontrjagin classes, 


1 = p(TR**" yy) = p(TM)p(N) = p(TM). 


Thus, all the Pontrjagin classes of M are trivial. 


26.5 The Hirzebruch Signature Formula 


On a 4n-dimensional compact oriented manifold M, the intersection of 2n-dimensional 
cycles defines a symmetric bilinear pairing 


(,): H?"(M;R) x H?"(M;R) > H*"(M;R) =R, 


called the intersection form. The intersection form is represented by a symmetric 
matrix, which has real eigenvalues. If b* is the number of positive eigenvalues and 
b~ is the number of negative eigenvalues of the symmetric matrix, then the difference 
o(M) := b* —b~ is called the signature of M. Clearly, it is a topological invariant 
of the 4n-dimensional manifold. 

In 1953 Hirzebruch found a formula for the signature in terms of Pontrjagin 
classes. 


Theorem 26.4 (Hirzebruch signature formula). The signature of a compact, 
smooth, oriented 4n-dimensional manifold is given by 


o(M) = Jf talPis---sPa) 


where the Ly’s are the L-polynomials defined in Appendix B. 


26.6 The Riemann—Roch Problem 


Many problems in complex analysis and algebraic geometry can be formulated in 
terms of sections of vector bundles. A holomorphic vector bundle is a holomorphic 
map 7: E + M of complex manifolds that is locally a product U x C” such that 
the transition functions are holomorphic. If EF is a holomorphic vector bundle over 
a complex manifold M, then the vector space of holomorphic sections of E can be 
identified with the zeroth cohomology group H°(M;E) with coefficients in the sheaf 
of holomorphic sections of E. When M is compact, this vector space turns out to be 
finite-dimensional, but its dimension is in general not so easy to compute. Instead, 
the alternating sum 
n 
4(M;E) = ¥ (-1)!dimH4(M;E), n=dimcM, 
g=0 


is more amenable to computation. 
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Let L be a complex line bundle over a compact Riemann surface M. The degree 
of L is defined to be fy, c1(L), where c;(L) is the first Chern class of L. The classical 
Riemann-Roch theorem states that for a holomorphic line bundle L over a compact 
Riemann surface M of genus g, 


y(M;L) =deglL—g+l. 


The HIrzebruch—-Riemann—Roch theorem, discovered in 1954, is a generalization to 
a holomorphic vector bundle E over a compact complex manifold M of complex 
dimension n 


(ME) = I, [ch(E)td(TM)]p, (26.1) 
where the Chern character 
ch(E) = ch(ci(E),...,¢r(E)) 
and the Todd polynomial 
td(TM) = td(c)(M),...,¢n(M)) 


are defined as in Appendix B and [ ],, means the component of degree n. 

In some cases one has vanishing theorems H4(M; EF) = 0 for higher cohomology 
q > 1. When this happens, dimH°(M;E) = 7(M;E), which can be computed from 
the Hirzebruch—-Riemann—Roch theorem. 


Chapter 6 


Principal Bundles and Characteristic Classes 


A principal bundle is a locally trivial family of groups. It turns out that the theory of 
connections on a vector bundle can be subsumed under the theory of connections on 
a principal bundle. The latter, moreover, has the advantage that its connection forms 
are basis-free. 

In this chapter we will first give several equivalent constructions of a connection 
on a principal bundle, and then generalize the notion curvature to a principal bun- 
dle, paving the way to a generalization of characteristic classes to principal bundles. 
Along the way, we also generalize covariant derivatives to principal bundles. 


$27 Principal Bundles 


We saw in Section |1 that a connection V on a vector bundle F over a manifold M 
can be represented by a matrix of 1-forms over a framed open set. For any frame 
e = |e, --: e,| for E over an open set U, the connection matrix @, relative to e is 
defined by 

Vxej= Yi (@.); (X)ei 


L 


for all C® vector fields X over U. If = [@, --- @,] = ea is another frame for E over U, 
where a: U + GL(r,R) is a matrix of C® transition functions, then by Theorem 22.1 
the connection matrix @, transforms according to the rule 


Oz = a'@a+ta'da. 


Associated to a vector bundle is an object called its frame bundle 1: Fr(E) > M; 
the total space Fr(£) of the frame bundle is the set of all ordered bases in the fibers of 
the vector bundle E — M, with a suitable topology and manifold structure. A section 
of the frame bundle 7: Fr(E) — M over an open set U C M is a map s: U > Fr(E) 
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such that 7 os = lly, the identity map on U. From this point of view a frame 
e = |e, --- e,] over U is simply a section of the frame bundle Fr(EZ) over U. 

Suppose V is a connection on the vector bundle E — M. Miraculously, there 
exists a matrix-valued 1-form @ on the frame bundle Fr(Z) such that for every frame 
e over an open set U C M, the connection matrix @, of V is the pullback of w by the 
section e: U > Fr(E) (Theorem 29.10). This matrix-valued 1-form, called an Ehres- 
mann connection on the frame bundle Fr(£), is determined uniquely by the connec- 
tion on the vector bundle EF and vice versa. It is an intrinsic object of which a con- 
nection matrix @, is but a local manifestation. The frame bundle of a vector bundle 
is an example of a principal G-bundle for the group G = GL(r,R). The Ehresmann 
connection on the frame bundle generalizes to a connection on an arbitrary principal 
bundle. 


This section collects together some general facts 
about principal bundles. 


27.1 Principal Bundles 


Let E, M, and F be manifolds. We will denote an open 
cover St of M either as {U,,} or more simply as an unin- 
dexed set {U} whose general element is denoted by U. 
A local trivialization with fiber F for a smooth surjection 
m: E — M isan open cover Lt = {U} for M together with 
a collection {@y: 1 '(U) 4 U x F | U © St} of fiber- 
preserving diffeomorphisms @y : m~'(U) 4 U x F: 


Charles Ehresmann 


t7eP (1905-1979) 


US 


where 7) is projection to the first factor. A fiber bundle with fiber F is a smooth 
surjection 7: E — M having a local trivialization with fiber F. We also say that it 
is locally trivial with fiber F. The manifold E is the total space and the manifold M 
the base space of the fiber bundle. 

The fiber of a fiber bundle 2: E + M over x € M is the set E, := 1~!(x). Because 
7 is a submersion, by the regular level set theorem ((21], Th. 9.13, p. 96) each fiber 
E, is aregular submanifold of E. For x € U, define @y x := du|z,: Ex + {x} x F to 
be the restriction of the trivialization @y : a~'(U) > U x F to the fiber Ey. 


Proposition 27.1. Let t: E — M be a fiber bundle with fiber F. If dy: n~'!(U) > 
U X F is a trivialization, then oy x: Ey — {x} x F is a diffeomorphism. 
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Proof. The map @¢y,x is smooth because it is the restriction of the smooth map @¢y to 
a regular submanifold. It is bijective because dy is bijective and fiber-preserving. Its 
inverse bus is the restriction of the smooth map ¢,, 'UxFon! (U) to the fiber 
{x} x F and is therefore also smooth. 


A smooth right action of a Lie group G on a manifold M is a smooth map 
u:MxG—M, 
denoted by x- g := L(x, g), such that for all x € M and g,heEG, 
(i) x-e =x, where e is the identity element of G, 
(ii) (x-g)-h=x-(gh). 
We often omit the dot and write more simply xg for x- g. If there is such a map Ll, we 


also say that G acts smoothly on M on the right. A left action is defined similarly. 
The stabilizer of a point x € M under an action of G is the subgroup 


Stab(x) := {g € G| x-g =x}. 
The orbit of x € M is the set 
Orbit(x) :=xG := {x-g €M |g eG}. 


Denote by Stab(x)\G the set of right cosets of Stab(x) in G. By the orbit-stabilizer 
theorem, for each x € M the map: G > Orbit(x), g++ x-g induces a bijection of sets: 


Stab(x)\G <> Orbit(x), 
Stab(x)g <> x-g. 


The action of G on M is free if the stabilizer of every point x € M is the trivial 
subgroup {e}. 

A manifold M together with a right action of a Lie group G on M is called a right 
G-manifold or simply a G-manifold. A map f: N — M between right G-manifolds 
is right G-equivariant if 

f(x-8) =f) -8 
for all (x,g) € N x G. Similarly, a map f: N > M between left G-manifolds is left 
G-equivariant if 

f(g-x) =8- F(x) 
for all (g,x) € GX N. 

A left action can be turned into a right action and vice versa; for example, if G 

acts on M on the left, then 

Xx-g= g! “x 
is a right action of G on M. Thus, if N is a right G-manifold and M is a left 
G-manifold, we say a map f: N — M is G-equivariant if 


f(x-g) =f(x)-g=9 !- f(x) (27.1) 
for all (x,g) EN x G. 
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A smooth fiber bundle 7: P — M with fiber G is a smooth principal G-bundle 
if G acts smoothly and freely on P on the right and the fiber-preserving local 
trivializations 


dy: m'(U) >UXxG 


are G-equivariant, where G acts on U x G on the right by 


(x,h) ‘s= (x, hg). 


Example 27.2 (Product G-bundles). The simplest example of a principal G-bundle 
over a manifold M is the product G-bundle 1: M x G — M. A trivialization is the 
identity map on M x G. 


Example 27.3 (Homogenous manifolds). If G is a Lie group and H is a closed sub- 
group, then the quotient G/H can be given the structure of a manifold such that the 
projection map 7: G—> G/H isa principal H-bundle. This is proven in [22, Th. 3.58, 
p. 120]. 


Example 27.4 (Hopf bundle). The group S! of unit complex numbers acts on the 
complex vector space C’”*! by left multiplication. This action induces an action of 
S! on the unit sphere S*”*! in C"*+!. The complex projective space CP” may be 
defined as the orbit space of S?”*! by S!. The natural projection S?”+! — CP" with 
fiber S! turn out to be a principal S'-bundle. When n = 1, S? —> CP! with fiber S! is 
called the Hopf bundle. 


Definition 27.5. Let m9: Q — N and mp: P + M be principal G-bundles. A 
morphism of principal G-bundles is a pair of maps (f: Q@ + P,f: N — M) such 
that f: Q > P is G-equivariant and the diagram 


a 


commutes. 


Proposition 27.6. [f =: P — M is a principal G-bundle, then the group G acts tran- 
sitively on each fiber. 


Proof. Since G acts transitively on {x} x G and the fiber diffeomorphism @y x: Py > 
{x} x G is G-equivariant, G must also act transitively on the fiber P,. 


Lemma 27.7. For any group G, a right G-equivariant map f : G— G is necessarily 
a left translation. 


Proof. Suppose that for all x, g € G, 
f (x8) = fxg. 
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Setting x = e, the identity element of G, we obtain 


F (8) = Fle)g = ly(e) (8); 
where ¢/.): G— Gis left translation by f(e). 


Suppose {Uq}qea is a local trivialization for a principal G-bundle 7: P > M. 
Whenever the intersection Ugg := Ua Uz is nonempty, there are two trivializations 
on 2~' (Ugg): 

ae oe o 
Uap X Ge 1 '(Uop) mc Uap x G. 
Then oy © 5 i Ugg x G — Ugg X Gis a fiber-preserving right G-equivariant map. 
By Lemma 27.7, it is a left translation on each fiber. Thus, 


(ba ° Og ')(x,h) = (x, 8ap (x)h), (27.2) 


where (x,h) € Ugg x G and ggg(x) € G. Because $% © og is a C® function of x 
and h, setting h = e, we see that ggg (x) is a C® function of x. The C” functions 
8ap: Ugg — G are called transition functions of the principal bundle 7: P + M 
relative to the trivializing open cover {Ug }aca. They satisfy the cocycle condition: 
for alla, B, ye A, 


SupSpy=8ay if Up MUgNUy #2. 


From the cocycle condition, one can deduce other properties of the transition 
functions. 


Proposition 27.8. The transition functions gg of a principal bundle m: P — M rel- 
ative to a trivializing open cover {Uq}qaea Satisfy the following properties: for all 
a,BeEA, 


(i) aq = the constant map e, 


(ii) Sap = Spa fUaNUp # @. 
Proof. (i) If a = B =y, the cocycle condition gives 
Saa8aa = §aa- 


Hence, gqo, = the constant map e. 
(ii) if y= a, the cocycle condition gives 


§aBSBa = faa = €& 


or 


8aB = Sa for Ug. Ug AD, 


In a principal G-bundle P — M, the group G acts on the right on the total space P, 
but the transition functions gyg in (27.2) are given by left translations by gyg (x) € G. 
This phenomenon is a consequence of Lemma 27.7. 
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27.2 The Frame Bundle of a Vector Bundle 


For any real vector space V, let Fr(V) be the set of all ordered bases in V. Suppose 
V has dimension r. We will represent an ordered basis vj,...,v, by a row vector 
v=[v, --- v;], so that the general linear group GL(r,R) acts on Fr(V) on the right 
by matrix multiplication 


via=(|v--: v;][a'] 


= [Dud nel. 


Fix a point v € Fr(V). Since the action of GL(7,R) on Fr(V) is clearly transitive and 
free, i.e., Orbit(v) = Fr(V) and Stab(v) = {7}, by the orbit-stabilizer theorem there 
is a bijection 
GL(r,R) ; 
2 GL(7,R) = 2 <> Orbit(v) = Fr(V), 
de: GL(AR) = Fae + Onbit(v) = FHV) 
gi ve. 
Using the bijection $,, we put a manifold structure on Fr(V) in such a way that 9, 


becomes a diffeomorphism. 
If v’ is another element of Fr(V), then v’ = va for some a € GL(r,R) and 


va(g) = vag = by (ag) = () o fa)(g)- 


Since left multiplication €,: GL(r,R) — GL(7,R) is a diffeomorphism, the manifold 
structure on Fr(V) defined by @, is the same as the one defined by @,q. We call Fr(V) 
with this manifold structure the frame manifold of the vector space V. 


Remark 27.9. A linear isomorphism ¢: V — W induces a C™ diffeomorphism @ : 
Fr(V) > Fr(W) by 


dl vr] = (01) -- Or) 
Define an action of GL(7,R) on Fr(R”) by 
8-[vi -- r] = [evi --- avy. 


Thus, if @: R” > R’ is given by left multiplication by g € GL(7,R), then so is the 
induced map @ on the frame manifold Fr(R’). 


Example 27.10 (The frame bundle). Let 7: E — M be a C™ vector bundle of rank r. 
We associate to the vector bundle E a C® principal GL(r, R)-bundle 2: Fr(E) > M 
as follows. As a set the total space Fr(£) is defined to be the disjoint union 


Fr(E) = | | Fr(é,). 


xeM 


There is a natural projection map 7: Fr(E) > M that maps Fr(E,) to {x}. 
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A local trivialization 9: E|y,, + Ua x R’ induces a bijection 


a: Fr(E)|u, “ Us x Fr(R’), 
[vi ++ vy] © Fr(Ex) > (x, [box(v1) ++ borx(vr)])- 


Via ba one transfers the topology and manifold structure from Ug, x Fr(R") to 
Fr(E)|y,. This gives Fr(E) a topology and a manifold structure such that 
m: Fr(E) > M is locally trivial with fiber Fr(R”). As the frame manifold Fr(R") 
is diffeomorphic to the general linear group GL(r,R), it is easy to check that 
Fr(E) > M is a C® principal GL(r,R)-bundle. We call it the frame bundle of the 
vector bundle E. 

On a nonempty overlap Ugg := Ua Us, the transition function for the vector 
bundle E is the C® function ggg: Ugg + GL(7,R) given by 


$00 051: Ung xR’ > Ugg xR’, 
(ba ° dg ')(x,w) = (x, 8ap(*)¥). 


Since the local trivialization for the frame bundle Fr(£) is induced from the trivializa- 
tion {Uq, do} for E, the transition functions for Fr(Z) are induced from the transition 
functions {g¢g} for E. By Remark 27.9 the transition functions for the open cover 
{Fr(E)|u, } of Fr(Z) are the same as the transition functions ggg: Ugg + GL(7,R) 
for the vector bundle E, but now of course GL(7, R) acts on Fr(R") instead of on R’. 


27.3 Fundamental Vector Fields of a Right Action 


Suppose G is a Lie group with Lie algebra g and G acts smoothly on a manifold P 
on the right. To every element A € g one can associate a vector field A on P called 
the fundamental vector field on P associated to A: for p in P, define 


d 
A,=— 
“P dt 


D: ge TP. 
t=0 


To understand this equation, first fix a point p € P. Then c,: t+ p- e'4 is acurve 
in P with initial point p. By definition, the vector A, is the initial vector of this curve. 
Thus, 


d 
A, =, (0) = Cpe & ) €T,P. 


As a tangent vector at p is a derivation on germs of C™ functions at p, in terms of a 


C™ function f at p, 
d d 
A = ok — = — 
Arf = Cp (5 _) I= ail 


Proposition 27.11. For each A € g, the fundamental vector field A is C® on P. 


t=0 


f(p-e). 


t=0 


ha 
di 


fo Cp >= 
0 
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Proof. It suffices to show that for every C® function f on P, the function Af is also 
C™ on P. Let u: P x G— P be the C® map defining the right action of G on P. For 
any p in P, 

Ag=<| flp-e)=“) (Four) 

<=ps dt bo P ~~ dt i u P; * 


tA 


Since e is aC™ function of t, and f and yw are both C®, the derivative 


“(Fe my(p.e4) 


is C™ in p and int. Therefore, A, f is a C™ function of p. 


Recall that X(P) denotes the Lie algebra of C® vector fields on the manifold P. 
The fundamental vector field construction gives rise to a map 
O:g>X(P), ofA) :=A. 
For p in P, define j,: G— P by jp(g) = p-g. Computing the differential j,, using 
the curve c(t) = e’4, we obtain the expression 


p-eA=A (27.3) 


=Ay. 


Jpx(A) = 


di|,_ 


This alternate description of fundamental vector fields, A p= J px (A), shows that the 
map 0: g — X(P) is linear over R. In fact, o is a Lie algebra homomorphism 
(Problem 27.1). 


Example 27.12. Consider the action of a Lie group G on itself by right multiplication. 
For p € G, the map jp: G— G, jpy(g) = p-g = £)(g) is simply left multiplication 
by p. By (27.3), for A € g, A, = lp,(A). Thus, for the action of G on G by right 
multiplication, the fundamental vector field A on G is precisely the left-invariant 
vector field generated by A. In this sense the fundamental vector field of a right 
action is a generalization of a left-invariant vector field on a Lie group. 

For g in a Lie group G, let cg: G+ G be conjugation by g: cg(x) = gxg™!. 
The adjoint representation is defined to be the differential of the conjugation map: 
Ad(g) = (¢g)«: 9 > 9. 


Proposition 27.13. Suppose a Lie group G acts smoothly on a manifold P on the 
right. Let rg: P — P be the right translation rg(p) = p-g. For A € g the fundamental 
vector field A on P satisfies the following equivariance property: 


rgxA = (Adg™!)A. 
Proof. We need to show that for every p in P, rz.(A,) = (Ad g A, For x in G, 


(rg 0 jp) (x) = pxg = pgg”'xg = jpg(g”'x8) = Cipg 0 Cg-1)(X). 
By the chain rule, 


rgx(Ap) = Tex jp»(A) = Jpg (Cg-1)«(A) — Jpg ((Adg”')A) a (Adg !)A 


ps” 
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27.4 Integral Curves of a Fundamental Vector Field 


In this section suppose a Lie group G with Lie algebra g := Lie(G) acts smoothly on 
the right on a manifold P. 


Proposition 27.14. For p € P and A € g, the curve c)(t) = p-e4, t E R, is the 


integral curve of the fundamental vector field A through p. 


Proof. We need to show that c/,(t) = A, (r) for allt € R and all p € P. Itis essentially 
a sequence of definitions: 


d d A sA 
epi) a ds pee a das fe e = A peta = Ac, (t)- 
Proposition 27.15. The fundamental vector field A on a manifold P vanishes at a 
point p in P if and only if A is in the Lie algebra of Stab(p). 


Proof. (<=) If A € Lie(Stab(p)), then e € Stab(p), so 


tA d 


-e"=—| p=. 
1=0 dt 


t=0 


(=) Suppose A, = 0. Then the constant map y(t) = p is an integral curve of A 
through p, since 


7 (t)=0=A, = Ay). 


On the other hand, by Proposition 27.14, c,(t) = p- e'4 is also an integral curve of A 
through p. By the uniqueness of the integral curve through a point, c,(t) = y(t) 
or p-e4 = p for all t € R. This implies that e’4 € Stab(p) and therefore A € 
Lie(Stab(p)). 


Corollary 27.16. For a right action of a Lie group G on a manifold P, let p © P and 
Jp: G— P be the map jp(g) = p-g. Then the kernel ker jp, of the differential of jp 
at the identity 


Jp» = (ip)x,e: 9 > TpP 
is Lie( Stab(p)). 


Proof. For A € g, we have A, = jpx(A) by (27.3). Thus, 


A€ketjp, <= > jpx(A)=0 
—= A,=0 
<= A€Lie(Stab(p)) (by Proposition 27.15). 
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27.5 Vertical Subbundle of the Tangent Bundle TP 


Throughout this section, G is a Lie group with Lie algebra g and 7: P— Misa 
principal G-bundle. On the total space P there is a natural notion of vertical tangent 
vectors. We will show that the vertical tangent vectors on P form a trivial subbundle 
of the tangent bundle TP. 

By the local triviality of a principal bundle, at every point p € P the differential 
Tx,p: TpP — Ty(p)M of the projection 7 is surjective. The vertical tangent subspace 
V, C T,P is defined to be ker 7,,,. Hence, there is a short exact sequence of vector 
spaces 


0 Vp —+ TyP ~s Typ)M — 0, (27.4) 


and 
dimV, = dim7,P — dim T;;,)M = dimG. 


An element of V, is called a vertical tangent vector at p. 


Proposition 27.17. For any A € g, the fundamental vector field A is vertical at every 
point p € P. 


Proof. With j,: G— P defined as usual by j,(g) = p-g, 
(10 jp)(8) = 2(p-8) = 2p). 


Since A,, = jpx(A) by (27.3), and  o jp is a constant map, 


Tx,p(Apy) = (Hs,p 2 jpx)(A) = (Wo jp)s(A) = 0. 


Thus, in case P is a principal G-bundle, we can refine Corollary 27.16 to show 
that j,. maps g into the vertical tangent space: 


(ip)*e: 9 Vp CTP. 
In fact, this is an isomorphism. 


Proposition 27.18. For p € P, the differential at e of the map jy: G — P is an iso- 


morphism of g onto the vertical tangent space: jpx = (jp)xe: 9 as V>. 


Proof. By Corollary 27.16, ker jp, = Lie(Stab(p)). Since G acts freely on P, the 
stabilizer of any point p € P is the trivial subgroup {e}. Thus, ker j,,. = 0 and jp, is 
injective. By Proposition 27.17, the image j,, lies in the vertical tangent space V>. 
Since g and V, have the same dimension, the injective linear map j,,: g + V, has 
to be an isomorphism. 


Corollary 27.19. The vertical tangent vectors at a point of a principal bundle are 
precisely the fundamental vectors. 
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Let B,,...,Be be a basis for the Lie algebra g. By the proposition, the funda- 
mental vector fields B;,...,B¢ on P form a basis of V, at every point p € P. Hence, 
they span a trivial subbundle V := Lpep Vp of the sanaent bundle TP. We call V the 
vertical subbundle of TP. 

As we learned in Section 20.5, the differential 2,: TP — TM of a C® map 
7: P —> M induces a bundle map #,.: TP — 2*TM over P, given by 


TP—"__+ TM — TP 3 Xp (p,MepXp). 


Se 


The map 7, is surjective because it sends the fiber T,P onto the fiber (1*TM), ~ 
T;(p)M. Its kernel is precisely the vertical subbundle V by (27.4). Hence, V fits into 
a short exact sequence of vector bundles over P: 


0+ V—> TP n*TM 3 0. (27.5) 


27.6 Horizontal Distributions on a Principal Bundle 


On the total space P of a smooth principal bundle 7: P — M, there is a well-defined 
vertical subbundle V of the tangent bundle TP. We call a subbundle H of TP a 
horizontal distribution on P if TP = V ®@K as vector bundles; in other words, 7,P = 
V,+H, and V,1H, =0 for every p € P. In general, there is no canonically defined 
horizontal distribution on a principal bundle. 


A splitting of a short exact sequence of vector bundles 0 > A +B a C — 0 over 
a manifold P is a bundle map k: C — B such that j ok = Ic, the identity bundle map 
on Cc. 


Proposition 27.20. Let 
03A>B4C30 (27.6) 


be a short exact sequence of vector bundles over a manifold P. Then there is a 
one-to-one correspondence 


{subbundles H C B| B=i(A) ®@H} <— {splittings k: C > B of (27.6)}. 


Proof. If H is a subbundle of B such that B = i(A) @ H, then there are bundle iso- 
morphisms H ~ B/i(A) ~ C. Hence, C maps isomorphically onto H in B. This gives 
a splitting k: C > B. 

If k: C— Bisa splitting, let H := k(C), which is a subbundle of B. Moreover, if 
i(a) = k(c) for some a € A and c € C, then 


0 = ji(a) = jk(c) =c. 


Hence, i(A) Nk(C) = 0. 
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Finally, to show that B = i(A) + k(C), let b € B. Then 
i(b—kj(b)) = j(b) — i(b) =0. 
By the exactness of (27.6), b—kj(b) = i(a) for some a € A. Thus, 
b=i(a)+kj(b) € i(A)+K(C). 
This proves that B = i(A) +k(C) and therefore B = i(A) ®k(C). 


As we just saw in the preceding section, for every principal bundle 7: P + M 
the vertical subbundle V fits into a short exact sequence (27.5) of vector bundles over 
P. By Proposition 27.20, there is a one-to-one correspondence between horizontal 
distributions on P and splittings of the sequence (27.5). 


Problems 


27.1. Lie bracket of fundamental vector fields 
Let G be a Lie group with Lie algebra g and let P be a manifold on which G acts on the right. 
Prove that for A,B € g, 


[A,B] = |A, 8). 


Hence, the map o: g > X(P), A+> A is a Lie algebra homomorphism. 


27.2. Short exact sequence of vector spaces 

Prove that if 0 + A > B a C — 0 is a short exact sequence of finite-dimensional vector 
spaces, then dimB = dimA + dimC. 

27.3. Splitting of a short exact sequence 

Suppose 0 + A tas C — 0 is a short exact sequence of vector bundles over a manifold P. 
A retraction of i: A> Bisamapr: B 4 A such that roi = 1,4. Show that 7 has a retraction 


if and only if the sequence has a splitting. 


27.4° The differential of an action 

Let uw: P x G— P be an action of a Lie group G on a manifold P. For g € G, the tangent 
space T,G may be identified with ¢,..9, where ,: G — G is left multiplication by g € G and 
9g =TG is the Lie algebra of G. Hence, an element of the tangent space 7, ») (P x G) is of 
the form (Xp, +A) for Xp € TyP and A € g. Prove that the differential 


Hx = Ms,(p.g)! T(p.g)(P x G) + TpgP 


is given by 
Mx (Xp, lexA) = rgx(Xp) + Ang: 


27.5. Fundamental vector field under a trivialization 
Let dg: 27 'Ug 3 Ug x G 


¢a(p) = (2(r),8a(P)) 


be a trivialization of Us in a principal bundle P. Let A € g, the Lie algebra of G and A 
the fundamental vector field on P that it induces. Prove that 


8ax(Ap) = le4(p)(A) € Te, (p)(G)- 
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27.6. Trivial principal bundle 
Prove that a principal bundle 7: P — M is trivial if and only if it has a section. 


27.7. Pullback of a principal bundle to itself 
Prove that if 7: P — M is a principal bundle, then the pullback bundle z*P — P is trivial. 


27.8. Quotient space of a principal bundle 
Let G be a Lie group and H aclosed subgroup. Prove that if 7P — M is a principal G-bundle, 
then P + P/H is a principal H-subbundle. 


27.9. Fundamental vector fields 
Let N and M be G-manifolds with G acting on the right. If A € g and f: N > M is 
G-equivariant, then 

fx(Ang) = Ay f(q): 
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$28 Connections on a Principal Bundle 


Let G be a Lie group with Lie algebra g. As we saw in the preceding section, on 
a principal G-bundle P — M, the notion of a vertical tangent vector is well defined, 
but not that of a horizontal tangent vector. A connection on a principal bundle is 
essentially the choice of a horizontal complement to the vertical tangent bundle on P. 
Alternatively, it can be given by a g-valued 1-form on P. In this section we will study 
these two equivalent manifestations of a connection: 


(i) a smooth right-invariant horizontal distribution on P, 
(ii) a smooth G-equivariant g-valued 1-form @ on P such that on the fundamental 
vector fields, 
@(A)=A forallAcg. (28.1) 


Under the identification of g with a vertical tangent space, condition (28.1) says that 
@ restricts to the identity map on vertical vectors. 

The correspondence between (i) and (ii) is easy to describe. Given a right- 
invariant horizontal distribution F( on P, we define a g-valued 1-form @ on P to 
be, at each point p, the projection with kernel 3{, from the tangent space to the 
vertical space. Conversely, given a right-equivariant g-valued 1-form @ that is the 
identity on the vertical space at each point p € P, we define a horizontal distribution 
H on P to be kerq@, at each p € P. 


28.1 Connections on a Principal Bundle 


Let G be a Lie group with Lie algebra g, and let 7: P — M be a principal G-bundle. 
A distribution on a manifold is a subbundle of the tangent bundle. Recall that a 
distribution 3 on P is horizontal if it is complementary to the vertical subbundle V 
of the tangent bundle 7P: for all p in P, 


TP = Vy ®Hop. 


Suppose + is a horizontal distribution on the total space P of a principal 
G-bundle 7: P + M. For p € P, if jp: G— P is the map j»(g) = p-g, then the ver- 
tical tangent space V, can be canonically identified with the Lie algebra g via the 
isomorphism j,,: g — V, (Proposition 27.18). Let v: T,P =V,®KH, — V, be 
the projection to the vertical tangent space with kernel J{,. For Y, € TpP, v(Yp) is 
called the vertical component of Y,. (Although the vertical subspace V7, is intrinsi- 
cally defined, the notion of the vertical component of a tangent vector depends on 
the choice of a horizontal complement H,.) If @, is the composite 

-—1 
Wp := fol ov: ToP > Vp % g, (28.2) 
then @ is a g-valued 1-form on P. In terms of @, the vertical component of 
Y, € T,P is 
v(¥p) = jp» (@p(¥p)). (28.3) 
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Theorem 28.1. [f H{ is a smooth right-invariant horizontal distribution on the total 
space P of a principal G-bundle 1: P — M, then the g-valued I-form @ on P defined 
above satisfies the following three properties: 

(i) for any A € g and p € P, we have @,(A,) =A; 

(ii) (G-equivariance) for any g € G, r,@ = (Ad ga; 
(iii) @ is C™. 


Proof. (i) Since A, is already vertical (Proposition 27.17), the projection v leaves it 
invariant, so 


@p(A,) = la (v(A,)) = os (A,) =A. 
(ii) For p € P and Y, € T,,P, we need to show 
Ope (Tex¥p) = (Adg')@p(¥p). 


Since both sides are IR-linear in Y, and Y, is the sum of a vertical and a horizontal 
vector, we may treat these two cases separately. 
If Y, is vertical, then by Proposition 27.18, Y, =A, for some A € g. In this case 


Ope ("gAy) = Ope ((Adg”')A,.) (by Proposition 27.13) 


=(Adg')A (by (i) 
=(Adg"')@,(A,) (by (i) again). 


If Y, is horizontal, then by the right-invariance of the horizontal distribution 1, 
SO is rg,Y,. Hence, 


Ope (Tex¥p) = 0 = (Adg™')@p(Yp). 


(iii) Fix a point p € P. We will show that @ is C™ in a neighborhood of p. Let 
B,,...,Bz be a basis for the Lie algebra g and B,,...,By the associated fundamental 
vector fields on P. By Proposition 27.11, these vector fields are all C” on P. Since H 
is aC™ distribution on P, one can find a neighborhood W of p and C™ horizontal vec- 
tor fields X1,...,X, on W that span H at every point of W. Then By,...,B¢,X1,...,Xn 
is aC™ frame for the tangent bundle TP over W. Thus, any C® vector field X on W 
can be written as a linear combination 


X=) a'Bit+ DY b/x; 
with C® coefficients a',b/ on W. By the definition of , 


o(X) = 0 (Sa‘B;) = > aB;. 


This proves that @ is a C™ 1-form on W. 
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Note that in this theorem the proof of the smoothness of w requires only that the 
horizontal distribution F( be smooth; it does not use the right-invariance of H. 


Definition 28.2. An Ehresmann connection or simply a connection on a principal 
G-bundle P — M is a g-valued 1-form w on P satisfying the three properties of 
Theorem 28.1. 


A g-valued 1-form a on P can be viewed as a map a: TP — g from the tangent 
bundle 7P to the Lie algebra g. Now both TP and g are G-manifolds: the Lie group 
G acts on TP on the right by the differentials of right translations and it acts on g on 
the left by the adjoint representation. By (27.1), a: TP — g is G-equivariant if and 
only if for all p € P, Xp € T,P, and g € G, 


a(Xp,-g) = ia O(Xp), 


or 
O(rgxXp) = (Adg”')or(Xp). 


Thus, @: TP — g is G-equivariant if and only if ry = (Adg—!)a for all g € G. 
Condition (ii) of a connection @ on a principal bundle says precisely that @ is G- 
equivariant as a map from TP to g. 


28.2 Vertical and Horizontal Components of a Tangent Vector 


As we noted in Section 27.5, on any principal G-bundle 7 : P — M, the vertical 
subspace V, of the tangent space 7,,P is intrinsically defined: 


V,:=kernt, : T,P > Ty (p)M. 


By Proposition 27.18, the map j,, naturally identifies the Lie algebra g of G with the 
vertical subspace V ,. 

In the presence of a horizontal distribution on the total space P of a principal 
bundle, every tangent vector Y, € T,,P decomposes uniquely into the sum of a vertical 
vector and a horizontal vector: 


Y, =v(¥p) +h(¥,) € Vp @ Hp. 


These are called, respectively, the vertical component and horizontal component of 
the vector Y,. As p varies over P, this decomposition extends to a decomposition of 
a vector field Y on P: 

Y¥ =v(Y) +A(Y). 


We often omit the parentheses in v(Y) and A(Y), and write vY and AY instead. 
Proposition 28.3. [fF is a C™ horizontal distribution on the total space P of a prin- 


cipal bundle, then the vertical and horizontal components v(Y) and h(Y) of a C® 
vector field Y on P are also C™. 
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Proof. Let @ be the g-valued 1-form associated to the horizontal distribution F( 
by (28.2). It is C” by Theorem 28.1(iii). In terms of a basis B,...,Be for g, we 
can write @ = Y w'B;, where w! are C® 1-forms on P. If Y, € T,P, then by (28.3) its 
vertical component v(Y,) is 


V(Yp) = jpx(@p(Yp)) = Jpx (> oo}, (Y)Bi) = ¥ 0}, (Yp)(Bi)p- 
As p varies over P, 
v(Y) =} ao! (Y)Bi. 


Since ', Y, and B; are all C”, so is v(Y). Because h(Y) = Y — v(Y), the horizontal 
component h(Y) of a C® vector field Y on P is also C®. 


On a principal bundle 7: P + M, if rz: P — P is right translation by g € G, then 
1 org = 1. It follows that 7, o rg, = 1,. Thus, the right translation r,, : T,P + TyeP 
sends a vertical vector to a vertical vector. By hypothesis, rg.3() = Hpg and hence 
the right translation rz, also sends a horizontal vector to a horizontal vector. 


Proposition 28.4. Suppose H is a smooth right-invariant horizontal distribution on 
the total space of a principal G-bundle m: P — M. For each g € G, the right trans- 
lation rg, commutes with the projections v and h. 


Proof. Any X, € T,P decomposes into vertical and horizontal components: 
Xp = V(Xp) +h(Xp). 
Applying rg, to both sides, we get 
TexXp = VexV(Xp) + rexh(Xp). (28.4) 


Since rg, preserves vertical and horizontal subspaces, rg,.v(Xp) is vertical and rg,.h(Xp) 
is horizontal. Thus, (28.4) is the decomposition of rz,X, into vertical and horizontal 
components. This means for every Xp € TpP, 


Vr gx(Xp) =Fexv(Xp) and hrgs (Xp) = rgxh(Xp). 


28.3 The Horizontal Distribution of an Ehresmann Connection 


In Section 28.1 we showed that a smooth, right-invariant horizontal distribution on 
the total space of a principal bundle determines an Ehresmann connection. We now 
prove the converse. 


Theorem 28.5. /f @ is a connection on the principal G-bundle ™: P — M, then 
Hp := ker @p, p € P, is a smooth right-invariant horizontal distribution on P. 


Proof. We need to verify three properties: 


(i) At each point p in P, the tangent space 7,P decomposes into a direct sum 
TpP = Vn BH. 
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(ii) For p € Pand g € G, rgx.(FLp) C Hyg. 
(iii) FC is aC™ subbundle of the tangent bundle TP. 


(i) Since H{, = ker Wp, there is an exact sequence 


03H, 1,P 494-0. 


The map jp+: 9 - Vy C T,P provides a splitting of the sequence. By Proposi- 
tion 27.20, there is a sequence of isomorphisms 


TpP ~ 9 DHy ~V~ PH. 


(ii) Suppose Y, € 1H, = ker @,. By the right-equivariance property of an Ehresmann 
connection, 

Ope (Tex¥p) = (12) p(¥p) = (Adg”')@,(¥,) = 0. 
Hence, rgxYp € Hyg. 
(iii) Let By,...,By be a basis for the Lie algebra g of G. Then @ = ¥, @'B;, where 
w!,...,@° are smooth R-valued 1-forms on P and for DEP, 


L 
Hp = () ker ,- 
i=l 
Since @,: T,P — g is surjective, w!,...,@° are linearly independent at p. 
Fix a point p € P and let x!,...,x”” be local coordinates near p on P. Then 


m 


Wo S fae, £3 Vat 
j=l 


for some C™ functions /' ‘ in a neighborhood of p. 


Let b!,...,b” be the fiber coordinates of TP near p, i.e., if vq © T,P for q near p, 
then 


- 0 
= j 
ey ele 
oxi y 
In terms of local coordinates, 


L 
Ho = ()ker aj = {vq € T,P | @) (vq) =0,i=1,...,¢} 
i=l 


m 
_ 1 H i f(y, ge 
= {(b',...,b") ER” | Dall Oyt asic) 
= 
Let F'(g,b) = DL, fi(g)b!, i=1,...,€. Since w!,...,@° are linearly indepen- 


dent at p, the Jacobian matrix [dF'/db/| = [fil. an € x m matrix, has rank £ at p. 
Without loss of generality, we may assume that the first ¢ x £ block of [ fi( p)| has 
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rank @. Since having maximal rank is an open condition, there is a neighborhood 
U, of p on which the first x £ block of [ fil has rank @. By the implicit function 


theorem, on Up, b!, a ,b! are C™ functions of pet), ...,D”, say 


pb! - bp! ‘Cane = ,b", 


bf = b'(b*!,...,b”). 


Let 
x= 5i(1,0,.. py? 
ee oe Oar 
Bed O O 
x= Yb’ (0,1,0,...,0)— + — > 
: 2 (0,1,0,--- lagi + Fee 
x, = b/0,0,... i232. 
m— a ede} ’ oxi oxin 


These are C™ vector fields on U, that span J, at each point g € U,. By the subbundle 
criterion (Theorem 20.4), H{ is aC” subbundle of 7'P. 


28.4 Horizontal Lift of a Vector Field to a Principal Bundle 


Suppose J is a horizontal distribution on a principal bundle 7: P - M. Let X bea 

vector field on M. For every p € P, because the vertical subspace V, is ker7,, the 
differential 7,.: Tp>P — Tz(p)M induces an isomorphism 

T,P 

ker 7, 


S Hp > TrpyM 


of the horizontal subspace J, with the tangent space 7,,,)M. Consequently, there is 
a unique horizontal vector X, € J, such that 2,(X,) = Xn(p) © Tr(p)M. The vector 
field X is called the horizontal lift of X to P. 


Proposition 28.6. [f FC is a C™ right-invariant horizontal distribution on the total 
space P of a principal bundle x: P — M, then the horizontal lift X of a C” vector 
field X on M is a C™ right-invariant vector field on P. 


Proof. Letx € M and p € x~!(x). By definition, 7, (X,) =X. If q is any other point 
of 2~!(x), then g = pg for some g € G. Since Torg =7, 


filha X= (8 oT; Xp = SA. 


By the uniqueness of the horizontal lift, rg.X,) = Xx, pe Lhis proves the right-invariance 
of X. 
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We prove the smoothness of X by proving it locally. Let {U} be a trivializing 
open cover for P with trivializations @y : 2~!(U) + U x G. Define 


Z(x,¢) = (Xx,0) € Ty.) (U x G). 


Let 7: U x GU be the projection to the first factor. Then Z is a C™ vector field 
on U x G such that 9..Z(x,2) = Xx, and Y := (@y)~'Z is aC” vector field on 2~'(U) 
such that 7.¥, = Xq(p). By Proposition 28.3, hY is a C™ vector field on nm '(U). 
Clearly it is horizontal. Because Y, = v(Y,) +/A(Y,) and 2,v(Y,) =0, we have 2,.Y, = 
t,h(¥,) = Xq(p)- Thus, hY lifts X over U. By the uniqueness of the horizontal lift, 
hY = X over U. This proves that X is a smooth vector field on P. 


28.5 Lie Bracket of a Fundamental Vector Field 


If a principal bundle P comes with a connection, then it makes sense to speak of hor- 
izontal vector fields on P; these are vector fields all of whose vectors are horizontal. 


Lemma 28.7. Suppose P is a principal bundle with a connection. Let A be the fun- 
damental vector field on P associated to A € g. 


(i) If Y is a horizontal vector field on P, then {A,Y] is horizontal. 
(ii) If Y is a right-invariant vector field on P, then [A,Y| = 0. 


Proof. (i) A local flow for A is ¢;(p) = pe’ = r.za(p) (Proposition 27.14). By the 
identification of the Lie bracket with the Lie derivative of vector fields [21, Th. 20.4, 
p. 225] and the definition of the Lie derivative, 


(r,-1a de YnetA = Yp 


A,Y|,=(LaY)p =i 28. 
(A, ¥]p = (Ca¥)p = lim (8.5) 
Since right translation preserves horizontality (Theorem 28.5), both (7,14), Yea and 


Y, are horizontal vectors. Denote the difference quotient in (28.5) by c(t). For every 
t near 0 in R, c(f) is in the vector space (, of horizontal vectors at p. Therefore, 
[A,Y]p =lim;4oc(t) € Hp. 


(ii) If Y is right-invariant, then 


(rea )a¥ pet =Y. 


In that case, it follows from (28.5) that [A,Y], = 0. 


Problems 


28.1. Maurer—Cartan connection 

If @ is the Maurer—Cartan form on a Lie group and 22: M x G — G is the projection to the 
second factor, prove that @ := 7; @ is a connection on the trivial bundle 2, : M x G— M. It is 
called the Maurer—Cartan connection. 
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28.2. Convex linear combinations of connections 
Prove that a convex linear combination @ of connections @,,...,@, on a principal bundle 
m: P — M is again a connection on P. (@ = Y/A;@;, YA; = 1, 4; > 0.) 


28.3. Pullback of a connection 
Let #9: Q > N and mp: P — M be principal G-bundles, and let (f: Q > P, f: N+ M) bea 
morphism of principal bundles. Prove that if 6 is a connection on P, then f*@ is a connection 


on Q. 
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$29 Horizontal Distributions on a Frame Bundle 


In this section we will explain the process by which a connection V on a vector bundle 
E over a manifold M gives rise to a smooth right-invariant horizontal distribution on 
the associated frame bundle Fr(E£). This involves a sequence of steps. A connection 
on the vector bundle E induces a covariant derivative on sections of the vector bundle 
along a curve. Parallel sections along the curve are those whose derivative vanishes. 
Just as for tangent vectors in Section 14, starting with a frame e, for the fiber of 
the vector bundle at the initial point x of a curve, there is a unique way to parallel 
translate the frame along the curve. In terms of the frame bundle Fr(£), what this 
means is that every curve in M has a unique lift to Fr(Z) starting at e, representing 
parallel frames along the curve. Such a lift is called a horizontal lift. The initial 
vector at e, of a horizontal lift is a horizontal vector at e,. The horizontal vectors at 
a point of Fr(£) form a subspace of the tangent space 7.,(Fr(E)). In this way we 
obtain a horizontal distribution on the frame bundle. We show that this horizontal 
distribution on Fr(£) arising from a connection on the vector bundle E is smooth and 
right-invariant. It therefore corresponds to a connection @ on the principal bundle 
Fr(Z). We then show that @ pulls back under a section e of Fr(Z) to the connection 
matrix @, of the connection V relative to the frame e on an open set U. 


29.1 Parallel Translation in a Vector Bundle 


In Section 14 we defined parallel translation of a tangent vector along a curve in a 
manifold with an affine connection. In fact, the same development carries over to an 
arbitrary vector bundle 1: E — M with a connection V. 

Let c: [a,b] + M be a smooth curve in M. Instead of vector fields along the 
curve c, we consider smooth sections of the pullback bundle c*E over [a,b]. These 
are called smooth sections of the vector bundle E along the curve c. We denote by 
T(c*E) the vector space of smooth sections of E along the curve c. If E = TM is the 
tangent bundle of a manifold M, then an element of I'(c*TM) is simply a vector field 
along the curve c in M. Just as in Theorem 13.1, there is a unique R-linear map 


D 

—: T(E) 9 T(E 

—: T(c*E) +1 (c"E), 
called the covariant derivative corresponding to V, such that 


(i) (Leibniz rule) for any C* function f on the interval [a, b], 


D(fs) df Ds_ 
dt a ge 


(ii) if s is induced from a global section § € T(M, E) in the sense that s(t) = 5(c(t)), 
then 


Ds 


HE (t) = Vers: 
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Definition 29.1. A section s € T'(c*E) is parallel along a curve c: |a,b| > M if 
Ds/dt = 0 on [a,b]. 


As in Section 14.5, the equation Ds/dt = 0 for a section s to be parallel is 
equivalent to a system of linear first-order ordinary differential equations. Suppose 
c: [a,b] + M maps into a framed open set (U,e1,...,e,) for E. Then s € T(c*E) can 
be written as 


s(t) = Vis" eice: 


By properties (i) and (ii) of the covariant derivative, 


Ds ds' -D 
ee WE BN gl 
dt dt ' ds dt il 
ds' ; 
=> 54 +8! V oe; 
j 
= oe + Ysa} (c'(t))e; 


Hence, Ds/dt = 0 if and only if 
a +¥'s/@}(c!(t)) = 0 for all i. 
dt ; J 


This is a system of linear first-order differential equations. By the existence and 
uniqueness theorems of differential equations, it has a solution on a small interval 
about a give point fg and the solution is uniquely determined by its value at to. Thus, 
a parallel section is uniquely determined by its value at a point. If s ¢ T(c*E) isa 
parallel section of the pullback bundle c*E, we say that s(b) is the parallel transport 
of s(a) along c: [a,b] + M. The resulting map: E.(q) > E,(p) is called parallel 
translation from E,(q) to E.(y)- 


Theorem 29.2. Let 1: E — M be a C™ vector bundle with a connection V and let 
c: [a,b] > M be a smooth curve in M. There is a unique parallel translation Qq p 
from Eq) to E,(p) along c. This parallel translation @q,p: E(qa) — Ec(p) is a linear 
isomorphism. 


The proof is similar to that of Theorem 14.14. 

A parallel frame along the curve c: [a,b] > M is a collection of parallel sections 
(e1(t),...,e-(t)), ¢ € [a,b], such that for each f, the elements ¢;(t),...,e-(t) form a 
basis for the vector space E,;). 

Let z: Fr(E) — M be the frame bundle of the vector bundle 7: E — M. A curve 
€(t) in Fr(E) is called a lift of the curve c(t) in M if c(t) = m(€(t)). It is a horizontal 
lift if in addition é(t) is a parallel frame along c. 

Restricting the domain of the curve c to the interval [a,t], we obtain from Theo- 
rem 29.2 that parallel translation is a linear isomorphism of E,(q) with E,(,). Thus, if 
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a collection of parallel sections (s)(t),...,s,(t)) € I'(c*E) forms a basis at one time 
t, then it forms a basis at every time t € [a,b]. By Theorem 29.2, for every smooth 
curve c: [a,b] + M and ordered basis (51,.0,.-., 5,0) for E,(q), there is a unique par- 
allel frame along c whose value at a is (s1,0,...,5-0). In terms of the frame bundle 


Fr(E), this shows the existence and uniqueness of a horizontal lift with a specified 
initial point in Fr(£) of a curve c(t) in M. 


29.2 Horizontal Vectors on a Frame Bundle 


On a general principal bundle vertical vectors are intrinsically defined, but horizontal 
vectors are not. We will see shortly that a connection on a vector bundle E over a 
manifold M determines a well-defined horizontal distribution on the frame bundle 
Fr(E). The elements of the horizontal distribution are the horizontal vectors. Thus, 
the notion of a horizontal vector on the frame bundle Fr(Z) depends on a connection 
on E. 


Definition 29.3. Let E — M be a vector bundle with a connection V, x € M, and 
ex € Fr(E,). A tangent vector v € T,, (Fr(Z)) is said to be horizontal if there is a curve 
c(t) through x in M such that v is the initial vector é’(0) of the unique horizontal lift 
of (t) of c(t) to Fr(Z) starting at e,. 


Our goal now is to show that the horizontal vectors at a point e, of the frame 
bundle form a vector subspace of the tangent space Te, (Fr(E ve To this end we 
will derive an explicit formula for é(0) in terms of a local frame for E. Suppose 
c: [0,5] > M is a smooth curve with initial point c(0) = x, and é(t) is its unique 
horizontal lift to Fr(£) with initial point ey = (e1,.0,...,e,0). Let s be a frame for E 
over a neighborhood U of x with s(x) = ey. Then s(c(t)) is a lift of c(t) to Fr(Z) with 
initial point e,, but of course it is not necessarily a horizontal lift (see Figure 29.1). 
For any ¢ € [0,5], we have two ordered bases s(c(t)) and é(t) for E,(7), so there is a 
smooth matrix a(t) € GL(r,R) such that s(c(t)) = é(t)a(t). Att = 0, s(c(0)) =e, = 
€(0), so that a(0) = /, the identity matrix in GL(7,R). 


Lemma 29.4. In the notation above, let s,: T,.(M) — Te, (Fr(E)) be the differential 
of s and a'(0) the fundamental vector field on Fr(E) associated to a'(0) € gl(7,R). 
Then 


Proof. Let P = Fr(E) and G = GL(7,R), and let uw: P x G > P be the right action 
of G on P. Then 


s(c(t)) = é(t)a(t) = wet), a(z)), (29.1) 
with c(0) = x, €(0) = e,, and a(0) = the identity matrix /. Differentiating (29.1) with 


respect to ¢ and evaluating at 0 gives 


8x(c'(0)) = Lx (e(0),a(0)) (2 (0), a’(0)). 
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Fig. 29.1. Two liftings of a curve 


By the formula for the differential of an action (Problem 27.4), 


“6, J 


8«(c'(0)) = ra(o)x@’ (0) +4’) 4) = (0) +a'(0) 


Lemma 29.5. Let E — M be a vector bundle with a connection V. Suppose s = 


(s1,--..,8,) is a frame for E over an open set U, €(t) a parallel frame over a curve c(t) 
in U with ¢(0) = s(c(0)), and a(t) the curve in GL(r,R) such that s(c(t)) = é(t)a(t). 
If @s = [5] is the connection matrix of V with respect to the frame (5),...,Sr) over 


U, then a’ (0) = a@,(c'(0)). 


Proof. Label c(0) = x and é;(0) = 5; (c(0)) = e;,x. By the definition of the connection 
matrix, 


V.(0)87 = > 1 (c'(0)) si(e(0)) = Yo} (c'(0)) ei.x- (29.2) 


On the other hand, by the defining properties of the covariant derivative 
(Section 29.1), 


V.(0)8j = Yai)! Oeix. (29.3) 


Thus, Lemma 29.4 for the horizontal lift of c’(0) can be rewritten in the form 


é’(0) = s,(c/(0)) — a'(0) = 5, (¢'(0)) — @, (0), : (29.4) 


ey Me 
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Proposition 29.6. Let 7: E — M be a smooth vector bundle with a connection over 
a manifold M of dimension n. For x € M and e, an ordered basis for the fiber E,, the 
subset He, of horizontal vectors in the tangent space T.,(Fr(E)) is a vector space of 
dimension n, and T,.: He, —> TM is a linear isomorphism. 


Proof. In formula (29.4), @;(c’(0)) is R-linear in its argument c’(0) because @, is 
a 1-form at c(0). The operation A ++ A,. of associating to a matrix A € gl(7,R) a 
tangent vector A, € To, (Fr(E )) is R-linear by (27.3). Hence, formula (29.4) shows 
that the map 


¢: TM — T.,(Fr(E)), 
(0) H &(0) 


is IR-linear. As the image of a vector space TM under a linear map, the set H{., of 
horizontal vectors @ (0) at e, is a vector subspace of T., (Fr(E)). 

Since 2(@(t)) = c(t), taking the derivative at t = 0 gives m.(@(0)) =c'(0), so 
7, is a left inverse to the map @. This proves that @: T,.M — T,., (Fr(E )) is injective. 
Its image is by definition H{,,. It follows that @: T,.M — H_, is an isomorphism with 
inverse 7,: He, + Te,M. 


29.3 Horizontal Lift of a Vector Field to a Frame Bundle 


We have learned so far that a connection on a vector bundle E — M defines a hor- 
izontal subspace J{, of the tangent space 7,,P at each point p of the total space of 
the frame bundle 7: P = Fr(E) — M. The horizontal subspace H{, has the same 
dimension as M. The vertical subspace V, of 7,,P is the kernel of the surjection 
1: TpP — Ty(p)M; as such, dimV, = dim7,P —dimM. Hence, V, and 1, have 
complementary dimensions in 7,P. Since 7,(V,) = 0 and 1,: Hp — Typ)M is an 
isomorphism, V, 3, = 0. It follows that there is a direct sum decomposition 


Ty(Fr(E)) = Vp @ Hp. (29.5) 


Our goal now is to show that as p varies in P, the subset H :=U, <p Hp of the tangent 
bundle 7P defines a C™ horizontal distribution on P in the sense of Section 27.6. 

Since 1x,» : Hp — Tyz(p)M is an isomorphism for each p € P, if X is a vector field 
on M, then there is a unique vector field X on P such that X, € H, and 7,, sip) = 
Xq(p)- The vector field X is called the horizontal lift of X to the frame bundle P. 

Since every tangent vector X,; € T,M is the initial vector c'(0) of a curve c, for- 
mula (29.4) for the horizontal lift of a tangent vector can be rewritten in the following 
form. 


Lemma 29.7 (Horizontal lift formula). Suppose V is a connection on a vector bun- 
dle E — M and Qs, is its connection matrix on a framed open set (U,s). For x € U, 
p = s(x) € Fr(E), and X, € T,M, let X, be the horizontal lift of X; to p in Fr(E). 
Then 


Xp = Sxx(Xx) — Os(Xx) .- 
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Proposition 29.8. Let E — M be a C® rank r vector bundle with a connection and 
m: Fr(E) > M its frame bundle. If X is a C® vector field on M, then its horizontal 
lift X to Fr(E) is a C® vector field. 


Proof. Let P = Fr(£) and G= GL(r,R). Since the question is local, we may assume 
that the bundle P is trivial, say P = M x G. By the right invariance of the horizontal 
distribution, 
X(x,a) = FaxX(x,1)- (29.6) 
Lets: M— P=M x Gbe the section s(x) = (x, 1). By the horizontal lift formula 
(Lemma 29.7), 


Xe) = Sx x(Xx) a @;(X;) (29.7) 


(x1)" 
Let p = (x,a) € P and let f be a C® function on P. We will prove that X,f is C® as 
a function of p. By (29.6) and (29.7), 


Xpf = TaxSx,x(Xx)f — Fax Os(Xx) oo ts (29.8) 


so it suffices to prove separately that (ras (Sex X;)) f and (fx @s(Xx) on 1) f are C* 


functions on P. 
The first term is 


(PasSea Xe) ) F =A f oe) 
=X(forgos)(x(p)) 
=X(f(s(x(p))a)) =X (F(u(s(a(P)),4))) 
=X(f(u(s(x(p)),2(p)))), (29.9) 
where tu: P x G— Pis the action of G on P and 12: P= M x G— Gis the projection 
72 (p) = M%(x,a) =a. The formula (29.9) expresses (TeeSiex GF as a C™ function 
on P. 


By the right equivariance of the connection form @,, in (29.8) the second term 
can be rewritten as 


Vax Ws (X, Os(Xx) pf = (Ada )o.(X) yf 
= (Adm(p)')@s(Xn(n)) Ff 


where (Adz2(p)~') @,(Xq(p)) is aC® function: P + gl(r,R) that we will denote by 
A(p). The problem now is to show that p> A(p) ,f is aC function of p. 


Let uw: Px G— P be the right action of G = GL(7,R) on P = Fr(E). Then 


Ay) f=] p(y) = 4] p(u(p,e4)). 


7 dt t=0 dt t=0 


Since f, 1, A, and the exponential map are all C* functions, A( P), f isaC® function 


of p. Thus, Mat in (29.8) is a C® function of p. This proves that X is a C® vector 
field on P. 
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Theorem 29.9. A connection V on a smooth vector bundle E — M defines a C” 
distribution H on the frame bundle nm: P = Fr(E) > M such that at any p € P, 


(i) TP =Vp@Hp; 
(ii) rgx(Hp) = Hyg for any g © G=GL/(r,R), 


where rz: P — P is the right action of G on P. 


Proof. To prove that H{ is a C~ subbundle of TP, let U be a coordinate open set in 
M and 51,...,5, aC” frame on U. By Proposition 29.8 the horizontal lifts 5],..., 5, 
are C® vector fields on U := 27! (U). Moreover, for each p € U, since Tx,p: Hp 
Tyz(p)M is an isomorphism, (51)p,---;(Sn)p form a basis for F{,. Thus, over U the 
C® sections 51,...,5, of TP span H. By Theorem 20.4, this proves that H is a C® 
subbundle of TP. 

Equation (29.5) establishes (i). 

As for (ii), let €’(0) € H,, where c(t) is a curve in M and é(t) = [vi (t) --- v-(t)] 
is its horizontal lift to P with initial point p. Here we are writing a frame as a row 
vector so that the group action is simply matrix multiplication on the right. For any 
g=(gi] € GL(,R), 


a(t)g = [Vigivilt) --- Vegi vi(t)] - 


Since Dv; /dt = 0 by the horizontality of v; and gi are constants, D(>, givi) /dt =0. 
Thus, €(t)g is the horizontal lift of c(t) with initial point €(0)g. It has initial tangent 
vector 

d 


—| E(t) g = rgx€" (0) € Hyg. 
dt |,—0 


This proves that rg. J5t, C Hyg. Because rz: Hp, — Hyg has a two-sided inverse 
it is bijective. In particular, rg.) = Hyg. 


Po-ly> 


29.4 Pullback of a Connection on a Frame Bundle 
Under a Section 


Recall that a connection V on a vector bundle E can be represented on a framed open 
set (U,e1,...,e,) for E by a connection matrix @,. depending on the frame. Such a 
frame e = (e1,...,@,) is in fact a section e: U + Fr(E£) of the frame bundle. We now 
use the horizontal lift formula (Lemma 29.7) to prove that the Ehresmann connection 
@ on the frame bundle Fr(£) determined by V pulls back under the section e to the 
connection matrix @,. 


Theorem 29.10. Let V be a connection on a vector bundle E — M and let @ be the 
Ehresmann connection on the frame bundle Fr(E) determined by V. If e = (e1,...,ér) 
is a frame for E over an open set U, viewed as a section e: U + Fr(E)|y, and @ is 
the connection matrix of V relative to the frame e, then We = e* o. 
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Proof. Let x € U and p = e(x) € Fr(E). Suppose X,, is a tangent vector to M at x. If 
we write @,, for the value of the connection matrix @, at the point x € U, then @ x 
is an r x r matrix of 1-forms at x and @, .(X,) is an r x r matrix of real numbers, i.e., 
an element of the Lie algebra gl(r,R). The corresponding fundamental vector field 
on Fr(£) is @e.(X,). By Lemma 29.7, the horizontal lift of X, to p € Fr(E) is 


xX = eX, — e.x(Xx) | 
Applying the Ehresmann connection @, to both sides of this equation, we get 
0 = (Xp) = @p(€,Xx) — Op (eae(X),) 
= (2"@p) (Xx) — @ex(X) (by Theorem 28.1(i)). 


Since this is true for all X, € 7,M, 


C* Wp = (C°W)x = Mex. 
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Let G be a Lie group with Lie algebra g. Associated to a connection @ on a principal 
G-bundle is a g-valued 2-form Q called its curvature. The definition of the curvature 
is suggested by the second structural equation for a connection V on a vector bundle 
E. Just as the connection form @ on the frame bundle Fr(£Z) pulls back by a section e 
of Fr(Z) to the connection matrix @, of V with respect to the frame e, so the curvature 
form © on the frame bundle Fr(Z) pulls back by e to the curvature matrix Q, of V 
with respect to e. Thus, the curvature form Q on the frame bundle is an intrinsic 
object of which the curvature matrices Q, are but local manifestations. 


30.1 Curvature Form on a Principal Bundle 


By Theorem 11.1 if V is a connection on a vector bundle E — M, then its connection 
and curvature matrices @, and Q, on a framed open set (U,e) = (U,e1,...,e,) are 
related by the second structural equation (Theorem | 1.1) 


Q. = d@e + Me \ Qe. 


In terms of the Lie bracket of matrix-valued forms (see (21.12)), this can be rewrit- 
ten as 


1 
Q. =d@e+ 5 |e, We]. 


An Ehresmann connection on a principal bundle is Lie algebra-valued. In a general 
Lie algebra, the wedge product is not defined, but the Lie bracket is always defined. 
This strongly suggests the following definition for the curvature of an Ehresmann 
connection on a principal bundle. 


Definition 30.1. Let G be a Lie group with Lie algebra g. Suppose @ is an Ehres- 
mann connection on a principal G-bundle 7: P — M. Then the curvature of the 
connection @ is the g-valued 2-form 


1 
Q=do+ 710, o]. 


Recall that frames for a vector bundle E over an open set U are sections of the 
frame bundle Fr(£Z). Let @ be the connection form on the frame bundle Fr(£) det- 
ermined by a connection V on E. In the same way that w pulls back by sections of 
Fr(Z) to connection matrices, the curvature form Q of the connection @ on Fr(E) 
pulls back by sections to curvature matrices. 


Proposition 30.2. [f V is a connection on a vector bundle E — M and @ is the asso- 
ciated Ehresmann connection on the frame bundle Fr(E), then the curvature matrix 
Q, relative to a frame e = (e1,...,é,) for E over an open set U is the pullback e*Q 
of the curvature Q on the frame bundle Fr(E). 
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1 
CO=edat+ ao [o, o| 


1 
=de*o+ 5 [e*@,e*@| (e* commutes with d and [ , | by Proposition 21.8) 


1 
=da,.+ 5 [We, We] (by Theorem 29.10) 


=Q,. (by the second structural equation) 


30.2 Properties of the Curvature Form 


Now that we have defined the curvature of a connection on a principal G-bundle 
1: P —> M, it is natural to study some of its properties. Like a connection form, 
the curvature form © is equivariant with respect to right translation on P and the 
adjoint representation on g. However, unlike a connection form, a curvature form 
is horizontal in the sense that it vanishes as long as one argument is vertical. In 
this respect it acts almost like the opposite of a connection form, which vanishes on 
horizontal vectors. 


Lemma 30.3. Let G be a Lie group with Lie algebra g and 1: P + M a principal 
G-bundle with a connection @. Fix a point p € P. 


(i) Every vertical vector X, € T,P can be extended to a fundamental vector field A 
on P for some A € g. 

(ii) Every horizontal vector Y, € T,P can be extended to the horizontal lift B ofaCc* 
vector field B on M. 


Proof. (i) By the surjectivity of j,,.: g + V, (Proposition 27.18) and Equation (27.3), 


Xp = Jp» (A) =A, 


for some A € g. Then the fundamental vector field A on P extends Xp. 


(ii) Let x = 2(p) in M and let By be the projection z,(Y,) € TM of the vector Y,. 
We can extend B, to a smooth vector field B on M. The horizontal lift B of B extends 
Y, on P. 


By Proposition 28.6, such a horizontal lift B is necessarily right-invariant. 


Theorem 30.4. Let G be a Lie group with Lie algebra g. Suppose ™: P+ M is a 
principal G-bundle, @ a connection on P, and Q the curvature form of . 


(i) (Horizontality) For p € P and Xp,¥, € TyP, 
On (Xps¥p) = (d@) y (BX, hY,). (30.1) 


(ii) (G-equivariance) For g € G, we have r,Q = (Adg!)Q. 


(iii) (Second Bianchi identity) dQ = [Q, a]. 
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Proof. (i) Since both sides of (30.1) are linear in X, and in Y,, we may decompose 
X, and Y, into vertical and horizontal components, and so it suffices to check the 
equation for vertical and horizontal vectors only. There are three cases. 


Case 1. Both X, and Y, are horizontal. Then 


1 
Qn (Xp, ¥p) = (d@) p (Xp, Vp) + 71: @p|(Xp,Yp) (definition of Q) 
- (d@)p(Xp,¥p) 


a ; ([@p(Xp). @p(Yp)| — [@p(Yp), op(Xp)]) 


= (d@)p(Xp,Yp) (@p(Xp) = 0) 
= (d@)p(hXp, hYp). (Xp, Yp horizontal) 


Case 2. One of X, and Y, is horizontal; the other is vertical. Without loss of gener- 
ality, we may assume X, vertical and Y, horizontal. Then [@p,@p|(Xp,Y>) = 0 as in 
Case 1. 

By Lemma 30.3 the vertical vector X, extends to a fundamental vector field A on 
P and the horizontal vector Y, extends to a right-invariant horizontal vector field B 
on P. By the global formula for the exterior derivative (Problem 21.8) 


dw(A,B) = A(a(B)) — B(@(A)) — o([A, 8). 


On the right-hand side, w(B) = 0 because B is horizontal, and Ba(A) = BA = 0 
because A is a constant function on P. Being the bracket of a fundamental and a hor- 
izontal vector field, [A, B] is horizontal by Lemma 28.7, and therefore w([A,B]) = 0. 
Hence, the left-hand side of (30.1) becomes 


Qy(Xp,¥p) = (d@) p(A,, Bp) =0. 
The right-hand side of (30.1) is also zero because hX, = 0. 


Case 3. Both X, and Y, are vertical. As in Case 2, we can write Xp = A, and 
Yp = B, for some A,B € g. We have thus extended the vertical vectors X, and Y, to 
fundamental vector fields X = A and Y = B on P. By the definition of curvature, 


Q(X, Y) = Q(A, B) 


= do(A,B) + = ([@(A), o(B)] — [w(B), @(A)]) 
= do(A,B) +[A,B). (30.2) 


In this sum the first term is 


(Problem 27.1) 
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Hence, (30.2) becomes 
Q(X, Y) = —A, B] + [A,B] =0. 


On the other hand, 
(d@),(hXp,hY,) = (d@),(0,0) = 0. 


(ii) Since the connection form @ is right-equivariant with respect to Ad, 


1 
rQ=r, (« o+ 5 [o, ol) (definition of curvature) 
=dr; Ler P: ition 21 
= dro + ° [7,@, 7,0] (Proposition 21.8) 


=d(Adg™)o+ 5 [(Adg a, (Adg)o] 


1 
= (Adg™') (ao + ;lo.01) 
= (Adg™!)Q. 
In this computation we used the fact that because Ad g~! = (oe ), is the differential 
of a Lie group homomorphism, it is a Lie algebra homomorphism. 


(iii) Taking the exterior derivative of the definition of the curvature form, we get 


1 
dQ = -d[a,o| 

2 

1 
= 7 ([do, @|—[@,do|) (Proposition 21.6) 
= [do, a] (Proposition 21.5) 

1 
= le — ,[0.0],0 (definition of Q) 
1 

= [Q, | _ zllo, o], | 
= [Q, a]. (Problem 21.5) 


In case P is the frame bundle Fr(£Z) of a rank r vector bundle £, with structure 
group GL(r,R), the second Bianchi identity becomes by Proposition 21.7 
dQ = [Q, 0] =QAQ-—@AQ, (30.3) 


where the connection and curvature forms @ and Q are gl(r,R)-valued forms on 
Fr(£). It should not be so surprising that it has the same form as the second Bianchi 
identity for the connection and curvature matrices relative to a frame e for E (Propo- 
sition 22.3). Indeed, by pulling back (30.3) by a frame e: U — Fr(E), we get 
dQ =e (QA a@)—e*(@AQ), 
de’ Q = (e&°Q) Ae aw — (ea) Ae*Q, 
dQ. = Qe. AN We — We N Qe, 


which is precisely Proposition 22.3. 
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Problems 


30.1. Curvature of the Maurer—Cartan connection 
Let G be a Lie group with Lie algebra g, and M a manifold. Compute the curvature of the 
Maurer—Cartan connection @ on the trivial bundle 7: Mx GM. 


30.2. Generalized second Bianchi identity on a frame bundle 

Suppose Fr(£) is the frame bundle of a rank r vector bundle E over M. Let @ be an Ehresmann 
connection and Q its curvature form on Fr(£). These are differential forms on Fr(Z) with 
values in the Lie algebra gl(r,R). Matrix multiplication and the Lie bracket on gl(r,R) lead 
to two ways to multiply gl(r,R)-valued forms (see Section 21.5). We write Q* to denote the 
wedge product of Q with itself k times. Prove that d(Q*) = [Q*, a]. 


30.3. Lie bracket of horizontal vector fields 
Let P — M be a principal bundle with a connection, and X,Y horizontal vector fields on P. 


(a) Prove that Q(X,Y) = —@([X,Y]). 
(b) Show that [X,Y] is horizontal if and only if the curvature Q(X, Y) equals zero. 
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$31 Covariant Derivative on a Principal Bundle 


Throughout this chapter, G will be a Lie group with Lie algebra g and V will be a 
finite-dimensional vector space. To a principal G-bundle 7: P — M and a represen- 
tation p: G—» GL(V), one can associate a vector bundle P xp V — M with fiber V. 
When p is the adjoint representation Ad of G on its Lie algebra g, the associated 
bundle P x aq g is called the adjoint bundle, denoted by Ad P. 

Differential forms on M with values in the associated bundle P x» V turn out to 
correspond in a one-to-one manner to certain V-valued forms on P called tensorial 
forms of type p. The curvature Q of a connection @ on the principal bundle P is 
a g-valued tensorial 2-form of type Ad on P. Under this correspondence it may be 
viewed as a 2-form on M with values in the adjoint bundle Ad P. 

Using a connection @, one can define a covariant derivative D of vector-valued 
forms on a principal bundle P. This covariant derivative maps tensorial forms to 
tensorial forms, and therefore induces a covariant derivative on forms on M with 
values in an associated bundle. In terms of the covariant derivative D, the curvature 
form is Q = Da, and Bianchi’s second identity becomes DQ = 0. 


31.1 The Associated Bundle 


Let x: P + M be a principal G-bundle and p: G > GL(V) a representation of G 
on a finite-dimensional vector space V. We write p(g)v as g-v or even gv. The 
associated bundle E := P Xp V is the quotient of P x V by the equivalence relation 


(p,v) ~ (pg,g7! -v) for g € Gand (p,v) € Px V. (31.1) 


We denote the equivalence class of (p,v) by [p,v]. The associated bundle comes with 
a natural projection B: P xp) V — M, B([p,v]) = 2(p). Because 


B([ps.g-'-v]) =2(pg) = 2(p) = B([p,¥)), 


the projection B is well defined. 
As a first example, the proposition below shows that an associated bundle of a 
trivial principal G-bundle is a trivial vector bundle. 


Proposition 31.1. Jf p : G— GL(V) is a finite-dimensional representation of a Lie 
group G, and U is any manifold, then there is a fiber-preserving diffeomorphism 


@: (UXG)xpVSUxYV, 
[(x, 8), ¥] nd (x,g°V). 


Proof. The map @ is well defined because if h is any element of G, then 


O([(x,g)h,no! -v)) = (x, (gh)-h7! -v) = (x,g-v) = @([(x,g),v])- 
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Define y: U x V > (U XG) Xp V by 


yx, v) _ [(@, 1),v}. 


It is easy to check that @ and y are inverse to each other, are C*, and commute with 
the projections. 


Since a principal bundle P — M is locally U x G, Proposition 31.1 shows that 
the associated bundle P xp V — M is locally trivial with fiber V. The vector space 
structure on V then makes P Xp V into a vector bundle over M: 


[P,V1] a [P, V2] = [P.V1 val 


Alp,v]=[p,Av], AER. (31.2) 


It is easy to show that these are well-defined operations not depending on the choice 
of p € E, and that this makes the associated bundle B: E — M into a vector bundle 
(Problem 31.2). 


Example 31.2. Let Ad: G—+ GL(g) be the adjoint representation of a Lie group G on 
its Lie algebra g. For a principal G-bundle 7: P — M, the associated vector bundle 
AdP := P XaqaqQg is called the adjoint bundle of P. 


31.2 The Fiber of the Associated Bundle 


If 7: P > M is a principal G-bundle, p: G — GL(V) is a representation, and E := 
P Xp V — M is the associated bundle, we denote by P, the fiber of P above x € M, 
and by E, the fiber of E above x € M. For each p € P,, there is a canonical way of 
identifying the fiber E, with the vector space V: 


Sp: VEX, 
v++ [p,v). 


Lemma 31.3. Let 7: P > M be a principal G-bundle, p: G + GL(V) a finite- 
dimensional representation, and E = P x pV the associated vector bundle. For each 
point p in the fiber P,, the map fy: V — E, is a linear isomorphism. 


Proof. Suppose [p,v] = [p,w]. Then (p,w) = (pg,g~!v) for some g € G. Since G 
acts freely on P, the equality p = pg implies that g = 1. Hence, w = g~!v = v. This 
proves that f, is injective. 

If [g,w] is any point in Z,, then g € P,, so g = pg for some g € G. It follows that 


[q;] = [pg,] = [p, gw] = fo(gw)- 


This proves that f, is surjective. 


The upshot is that every point p of the total space P of a principal bundle gives 
a linear isomorphism f,: V — Ez,p) from V to the fiber of the associated bundle E 
above 7(p). 
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Lemma 31.4. Let E = P XV be the vector bundle associated to the principal G- 
bundle P + M via the representation p: G—> GL(V), and f,: V — Ey the linear 
isomorphism v ++ [p,v]. If g € G, then fg = foo P(g). 


Proof. Forv eV, 
fog(v) = [ps.¥] = [p.8-¥] = fo(g-v) = fo(p(g)v)- 


Example 31.5. Let 7: P — M be a principal G-bundle. The vector bundle P xp 
V > M associated to the trivial representation p: G > GL(V) is the trivial bundle 
M x V — M, for there is a vector bundle isomorphism 


PxXpV—>Mx’V, 
[p.v] = [pg.g'-v] = [pg,v] > (x(p),v), 


with inverse map 
(x,v) 4 [p,v] for any p € 2! (x). 


In this case, for each p € P the linear isomorphism f,: V > Ey =V, v + [p,v], is 
the identity map. 


31.3 Tensorial Forms on a Principal Bundle 


We keep the same notation as in the previous section. Thus, 7: P — M is a principal 
G-bundle, p : G— GL(V) a finite-dimensional representation of G, and E := P Xp V 
the vector bundle associated to P via p. 


Definition 31.6. A V-valued k-form @ on P is said to be right-equivariant of type 
p or right-equivariant with respect to p if for every g € G, 
re =p(g')-9. 


What this means is that for p € P and yj,..., vg € TpP, 


(77) p(V1, +++ Ve) = p(g~')(@p(v1,..-,V%))- 


In the literature (for example, [12, p. 75]), such a form is said to be pseudo- 
tensorial of type p. 


Definition 31.7. A V-valued k-form @ on P is said to be horizontal if @ vanishes 
whenever one of its arguments is a vertical vector. Since a 0-form never takes an 
argument, every 0-form on P is by definition horizontal. 


Definition 31.8. A V-valued k-form on P is tensorial of type p if it is right- 
equivariant of type p and horizontal. The set of all smooth tensorial V-valued 
k-forms of type p is denoted by Of (P,V). 
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Example. Since the curvature Q of a connection @ on a principal G-bundle P is 
horizontal and right-equivariant of type Ad, it is tensorial of type Ad. 


The set OF (P,V) of tensorial k-forms of type p on P becomes a vector space 
with the usual addition and scalar multiplication of forms. These forms are of special 
interest because they can be viewed as forms on the base manifold M with values in 
the associated bundle E := P Xp V. To each tensorial V-valued k form @ € oF (P,V) 


we associate a k-form g’ € Q*(M,E) as follows. Given x € M and v1,...,v% € TM, 
choose any point p in the fiber P, and choose lifts u1,...,u, at p of v1,...,V%, Le., 
vectors in T;,P such that 7,(u;) = v;. Then g’ is defined by 


@2(V1,---, Ve) = fo (Pp(u1,--. Ue) € Ex, (31.3) 
where f,: V — E;, is the isomorphism v +> [p, v| of the preceding section. 
Conversely, if y € Q*(M,E), we define y* € co} (P,V) as follows. Given p € P 
and u1,...,Ug © TpP, let x = m(p) and set 
Wh (ur, --- sk) = Fy! (Wic(Matti,---, Matte) €V. (31.4) 
Theorem 31.9. The map 
k k 
Q,(P,V) > Q"(M,E), 
pg’, 
is a well-defined linear isomorphism with inverse y* Gy. 


Proof. To show that Q is well defined, we need to prove that the definition (31.3) is 
independent of the choice of p € P, and of u;,...,uz € TpP. Suppose u,,...,uj, € TyP 
is another set of vectors such that 7, (ui) = v;. Then 7,(uj — uj) = 0 so that ui — u; is 
vertical. Since @ is horizontal and k-linear, 


Pp (Uy, -.- Ue) = Pp(ui + vertical, ..., uz + vertical) 
= Pp Uy a05 lle) 
This proves that for a given p € P, the definition (31.3) is independent of the choice 
of lifts of v1,..., vg to p. 
Next suppose we choose pg instead of p as the point in the fiber P,. Because 


Norg=f, 
Tx (Teli) = (1 ° Ig) Ui = 1, Uj = Vi, 


so that rg.U},...,TgxUx are lifts of vj,...,vg to pg. We have, by right equivariance 
with respect to p, 


Ppg (Text --+ Texte) = (TePpg) (U1, -+ +, UK) 
=p(g-')@p(ur,-.-,#K)- 
So by Lemma 31.4, 
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Fra Pig Mott press rgxlUlk)) = fog (p (g-')@p(u1 yee Ux) 
= (fp ° p(g)) (P(g ')@p(ui,--- ue) 
a fo (Pp(ur,- oe 1Uk)) 
This proves that the definition (31.3) is independent of the choice of p in the fiber P,. 
Let y € Q*(M,E). It is clear from the definition (31.4) that y* is horizontal. It 
is easy to show that yi is right-equivariant with respect to p (Problem 31.4). Hence, 
wi € Ok (PV). 
For vj,...,v%¢ € TM, choose p € P, and vectors u},..., uz € TP that lift vy,..., vg. 
Then 


(yi?) (V1, -6+5 Ve) = fy(wi(u1,...,ue)) 
= fo (fn! (We(metts,.--57sttx)) ) 
= We(V1,--- 5 VE): 


Hence, y? =y. 
Similarly, g’! = ¢ for 9 € Of (P,V), which we leave to the reader to show (Prob- 
lem 31.5). Therefore, the map y +> yi is inverse to the map @ +> Q’. 


Example 31.10 (Curvature as a form on the base). By Theorem 31.9, the curvature 
form Q of a connection on a principal G-bundle P can be viewed as an element of 
Q?(M, Ad P), a 2-form on M with values in the adjoint bundle Ad P. 


When k = 0 in Theorem 31.9, Oe V) consists of maps f: P — V that are right- 
equivariant with respect to p: 


or 
f(ps) =p(s')f(p) =8'-f(p). 
On the right-hand side of Theorem 31.9, 


Q°(M,P x pV)= Q°(M,E) = sections of the associated bundle E. 
Hence, we have the following corollary. 


Corollary 31.11. Let G be a Lie group, P — M a principal G-bundle, and p: G—- 
Aut(V) a representation of G. There is a one-to-one correspondence 


G-equivariant maps sections of the associated bundle 
f[: PAV PxXpV—M . 


By the local triviality condition, for any principal bundle 7: P —> M the projec- 
tion map z is a submersion and therefore the pullback map 2*: Q*(M) > Q*(P) is 
an injection. A differential form @ on P is said to be basic if it is the pullback z* y 
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of a form y on M; it is G-invariant if r7@ = @ for all g ¢ G. More generally, for any 
vector space V, these concepts apply to V-valued forms as well. 

Suppose p: G— GL(V) is the trivial representation p(g) = 1 for all g € G. Then 
an equivariant form @ of type p on P satisfies 


re® =p(g'!)-o=0 for all g EG. 


Thus, an equivariant form of type p for the trivial representation p is exactly an 
invariant form on P. Unravelling Theorem 31.9 for a trivial representation will give 
the following theorem. 


Theorem 31.12. Let 7: P — M be a principal G-bundle and V a vector space. 
A V-valued form on P is basic if and only if it is horizontal and G-invariant. 


Proof. Let p: G— GL(V) be the trivial representation. As noted above, OF (PV) 
consists of horizontal, G-invariant V-valued k-forms on P. 
By Example 31.5, when p is the trivial representation, the vector bundle E = 
P Xp V is the product bundle M x V over M and for each p € P, the linear isomor- 
phism f,: V + E, =V, where x = 2(p), is the identity map. Then the isomorphism 
O*(M,E) = QF(M,M x V) = O*(M,V) > OF (PV), 
wy, 


is given by 


wi (ui... Uk) = We(Meu,..., Tete) (by (31.4) 
= (mW) p(u1,..-,Uk)- 
Therefore, 
we =a'y. 


This proves that horizontal, G-invariant forms on P are precisely the basic forms. 


31.4 Covariant Derivative 


Recall that the existence of a connection w on a principal G-bundle 7: P — M is 
equivalent to the decomposition of the tangent bundle TP into a direct sum of the 
vertical subbundle V and a smooth right-invariant horizontal subbundle H{. For any 
vector X, € T,P, we write 

Xp = VXp +hXp 


as the sum of its vertical and horizontal components. This will allow us to define 
a covariant derivative of vector-valued forms on P. By the isomorphism of Theo- 
rem 31.9, we obtain in turn a covariant derivative of forms on M with values in an 
associated bundle. 

Let p: G— GL(V) be a finite-dimensional representation of G and let E := 
P Xp V be the associated vector bundle. 
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Proposition 31.13. [fp € Q*(P,V) is right-equivariant of type p, then so is dg. 
Proof. For a fixed g € G, 


redg =drep =dp(g')p 
=p(g')d@, 


since p(g~!) is a constant linear map for a fixed g. 


In general, the exterior derivative does not preserve horizontality. For any V- 
valued k-form @ on P, we define its horizontal component ¢" € Q*(P,V) as follows: 
for p € Pand vyy,...,v, € T,P, 

Oh (Vis +5 VK) = ,(hv1,...,hvx). 
Proposition 31.14. /f g ¢ Q*(P,V) is right-equivariant of type p, then so is gp". 
Proof. For g € G, p€ P, and vj,...,v% € TpP, 


TO. NM Vigeog =O ovigs<itern) (definition of pullback) 


= Ope 

= Opglhrgxvi,-..,Argevg) (definition of o") 

= Ppe(Terhvi,...:TexhVe) (Proposition 28.4) 

= (re pg) (M1, .. +, AVg) 

g |) @p(hy,...,hv~)  (tight-equivariance of @) 


go!) ORV... VK) 


Propositions 31.13 and 31.14 together imply that if @ ¢ Q*(P,V) is right- 
equivariant of type p, then (dg)" € Q**!(P,V) is horizontal and right-equivariant 
of type p, i.e., tensorial of type p. 

Definition 31.15. Let 7: P—> M be a principal G-bundle with a connection @ 


and let V be a real vector space. The covariant derivative of a V-valued k-form 
g € OF (PV) is Do = (do)". 


Let p: G— GL(V) be a finite-dimensional representation of the Lie group G. 
The covariant derivative is defined for any V-valued k-form on P, and it maps a right- 
equivariant form of type p to a tensorial form of type p. In particular, it restricts to a 
map 


D: OF (P,V) + QE (BV) (31.5) 
on the space of tensorial forms. 


Proposition 31.16. Let 7: P — M be a principal G-bundle with a connection and 
p: G— GL(V) a representation of G. The covariant derivative 


D: OF (P,V) + O5*1(P,V) 


on tensorial forms of type p is an antiderivation of degree +1. 
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Proof. Let w,t € Q5(P,V) be tensorial forms of type p. Then 


D(w At) = (d(@A7))" 

= ((d@) At +(-1)*8°@ Adt)" 
= (do)" At* + (—1)88% qq" A (dt)!" 

= Dwi" + (—1)%8°@" A Dr. 


Since Tt and @ are horizontal, t’ = t and w" = @. Therefore, 


D(@ AT) = DoAt+ (—1)*8°@ A Dt. 


If E := P Xp V is the associated vector bundle via the representation p, then the 
isomorphism of Theorem 31.9 transforms the linear map (31.5) into a linear map 


D: Q*(M,E) > QF! (M,E). 


Unlike the exterior derivative, the covariant derivative depends on the choice of 
a connection on P. Moreover, D* ¢ 0 in general. 


Example 31.17 (Curvature of a principal bundle). By Theorem 30.4 the curvature 
form Q € 4 (P, g) on a principal bundle is the covariant derivative Da@ of the con- 
nection form @ € Q!(P,g). Because @ is not horizontal, it is not in Q4 j(P,g). 


31.5 A Formula for the Covariant Derivative of a Tensorial Form 


Let 7: P — M bea smooth principal G-bundle with a connection @, and let p: G— 
GL(V) be a finite-dimensional representation of G. In the preceding section we 
defined the covariant derivative of a V-valued k-form @ on P: Do = (dg)", the 
horizontal component of d@. In this section we derive a useful alternative formula 
for the covariant derivative, but only for a tensorial form. 

The Lie group representation p : G—» GL(V) induces a Lie algebra representa- 
tion p.: g > gl(V), which allows us to define a product of a g-valued k-form T and 
a V-valued ¢-form @ on P: for p € P and vy,...,ve40 € TP, 


(T-@)p(Vi,---s Vere) 
1 
= TA DY sgn(o) ps (tp(vo(1),-+Vo(k))) Pp (Voett) s+ Vo(k+e)) - 


* OESk+¢ 


For the same reason as the wedge product, T- @ is multilinear and alternating in 
its arguments; it is therefore a (k + ¢)-covector with values in V. 


Example 31.18. If V = g and p = Ad: G-> GL(g) is the adjoint representation, then 
(T-P)p=— by sgn(o) [tp (Vo(1)s+«+s¥o())s Op Va(kt1)9++ Vo(e+e))| : 


In this case we also write [T, @] instead of T- @. 
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Theorem 31.19. Let 7: P — M be a principal G-bundle with connection form @, 
and p: G—>+ GL(V) a finite-dimensional representation of G. If @ € OF (PV) isa 
V-valued tensorial form of type p, then its covariant derivative is given by 


Doe =dP+a-9@. 


Proof. Fix p € P and vj,...,Vx41 € TP. We need to show that 


(dp) p(hv1,... vezi) = (€)p(v1,---,¥e+1) 


Because both sides of (31.6) are linear in each argument v;, which may be de- 
composed into the sum of a vertical and a horizontal component, we may assume 
that each v; is either vertical or horizontal. By Lemma 30.3, throughout the proof we 
may further assume that the vectors v1,...,V%41 have been extended to vector fields 
X1,---,Xx¢41 on P each of which is either vertical or horizontal. If X; is vertical, then 
it is a fundamental vector field A; for some A; € g. If X; is horizontal, then it is the 
horizontal lift B; of a vector field B; on M. By construction, B; is right-invariant 
(Proposition 28.6). 

Instead of proving (31.6) at a point p, we will prove the equality of functions 


(d@) (AX,,...,AXi41) =1+0, (31.7) 


where 
I= (dp) (X1,. oe XK41) 


and 


1 
t= kt > sgn(O) px (@(Xo(1))) 9 (Xo(2); vee Xo(k+1)) : 


* OESky1 


Case 1. The vector fields X,,...,X,4 are all horizontal. 
Then II = 0 because @(X,,1)) = 0 for all o € S41. In this case, (31.7) is trivially 
true. 


Case 2. At least two of X1,...,Xk41 are vertical. 
By the skew-symmetry of the arguments, we may assume that X; = A; and X2 = A2 
are vertical. By Problem 27.1, [X1,X2] = [A1,A2] is also vertical. 

The left-hand side of (31.7) is zero because hX; = 0. By the global formula for 
the exterior derivative [21, Th. 20.14, p. 233], 


l= SX (-1) IX @(...,. Xi...) + »y (—1)'T/ p([X, Xj], --- Xin Xie) 
i=l 1<i<j<k+1 
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In this expression every term in the first sum is zero because @ is horizontal and at 
least one of its arguments is vertical. In the second sum at least one of the arguments 
of @ is Xj, X2, or [X1,X], all of which are vertical. Therefore, every term in the 
second sum in I is also zero. 

As for II in (31.7), in every term at least one of the arguments of @ is vertical, so 
Il=0. 


Case 3. The first vector field X; = A is vertical; the rest Xz,...,X,41 are horizontal 


and right-invariant. 
The left-hand side of (31.7) is clearly zero because hX, = 0. 
On the right-hand side, 


I= (de) (X1,..-,Xis1) 


=S(- 1)*'X:( (X),.. i> »»sXkq1) 
aD a 14 9((XXj, X},.. ce ight Snes Mea) 


Because ¢ is horizontal and X is vertical, the only nonzero term in the first sum is 
X19 (X2,.--,Xk+1) =AM(X,..- X41) 
and the only nonzero terms in the second sum are 
k+l a 
SY (= 1)! 4p ([X1, Xj], X1,X2,- + Xj Xp) 
j=2 
Since the X;, 7 =2,...,k-+1, are right-invariant horizontal vector fields, by Lemma 28.7, 


[X1,X;] = [A,X;] =0. 


Therefore, 
I =AQ(X, “ae Xk+1)- 


If o(i) = 1 for any i > 2, then 


@(Xo(2).+- + Xo(k41)) = 0. 


It follows that the nonzero terms in II all satisfy o(1) = 1 and 


1 
I=; > sgn(o)px(@(%1)) @ (Xo(2),---Xo(e+1)) 


“ OESk 41 
o(1)=1 


1 
Hi Y  sgn(o)p«(A)@ (Xo(2)1-++Xo(e+1)) 


* OESK+1 
o(1)=1 


= px (A)@ (Xo,...,Xk41) (because @ is alternating). 
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Denote by f the function @(X2,...,X,41) on P. For p € P, to calculate A,f, 
choose a curve c(t) in G with initial point c(0) = e and initial vector c'(0) = A, for 
example, c(t) = exp(tA). Then with j,: G— P being the map jp(g) = p-g, 


il)! 


Anf = jp»(A)f = jpe(e'(0)) f = Jos ( (4 
d 


‘ d 
=(jpoc)s (+ _)i- dt 
By the right-invariance of the horizontal vector fields X2,...,Xx+1, 
(f° jpoc)(t) = f(pel(t)) 
= Ppc(t) (X2,pe(e)> a Xk+1,po(t)) 


(fo Jpe c), 


t=0 


= Ppci(t) (Te(e)xX2,p; oe Fotis Xhit,p) 

= re(t) Ppe(t) (X2,p, tee Aetip) 

=p(c(t)|)o, (Xo py-+--Xet1p) (right-equivariance of @) 
=p(c(t)')f(p). 


Differentiating this expression with respect to ¢ and using the fact that the differential 
of the inverse is the negative [21, Problem 8.8(b)], we have 


Apf = (f° jp oc)’ (0) = —ps(c'(0)) f(p) = —ps(A) f(2)- 
So the right-hand side of (31.7) is 
I+ =Af+tp.(A)f = —ps(A)f + px(A)f = 0. 


If V is the Lie algebra g of a Lie group G and p is the adjoint representation of 
G, then - @ = [@, g]. In this case, for any tensorial k-form p € OK ,(P,g), 


Do =do+|o, 9). 


Although the covariant derivative is defined for any V-valued form on P, The- 
orem 31.19 is true only for tensorial forms. Since the connection form @ is not 
tensorial, Theorem 31.19 cannot be applied to w. In fact, by the definition of the 
curvature form, 


1 
Q=do+ 710, a]. 


By Theorem 30.4, Q = (dw)" = Da. Combining these two expressions for the 
curvature, one obtains 


1 
Do = do+ 710, o]. 


The factor of 1/2 shows that Theorem 31.19 is not true when applied to @. 
Since the curvature form Q on a principal bundle P is tensorial of type Ad, Theo- 
rem 31.19 applies and the second Bianchi identity (Theorem 30.4) may be restated as 


DQ = dQ. + [o,Q] =0. (31.8) 
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Problems 


Unless otherwise specified, in the following problems G is a Lie group with Lie algebra g, 
m: P — M a principal G-bundle, p: G + GL(V) a finite-dimensional representation of G, 
and E = P Xp V the associated bundle. 


31.1. Transition functions of an associated bundle 

Show that if {(Uo,, 6q)} is a trivialization for P with transition functions Sap: UaNUg > G, 
then there is a trivialization {(Uq, Wo.) } for E with transition functions p o ggg: Ug Ug > 
GL(V). 


31.2. Vector bundle structure on an associated bundle 
Show that the operations (31.2) on E = P Xp V are well defined and make the associated 
bundle B : E — M into a vector bundle. 


31.3. Associated bundle of a frame bundle 

Let E > M be a vector bundle of rank r and Fr(£) — M its frame bundle. Show that the 
vector bundle associated to Fr(£) via the identity representation p: GL(r,R) > GL(r,R) is 
isomorphic to E. 


31.4. Tensorial forms 
Prove that if y € Q*(M,P xp V), then y* € O*(P,V) is right-equivariant with respect to p. 


31.5. Tensorial forms 
For 9 € Of, (P,V), prove that 9”! = 9. 
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$32 Characteristic Classes of Principal Bundles 


To a real vector bundle E — M of rank r, one can associate its frame bundle 
Fr(E) > M, a principal GL(r,R)-bundle. Similarly, to a complex vector bundle of 
rank r, one can associate its frame bundle, a principal GL(7,C)-bundle and to an 
oriented real vector bundle of rank r, one can associate its oriented frame bundle, 
a principal GL*(r,R)-bundle, where GL*(r,R) is the group of all r x r matrices 
of positive determinant. The Pontrjagin classes of a real vector bundle, the Chern 
classes of a complex vector bundle, and the Euler class of an oriented real vector 
bundle may be viewed as characteristic classes of the associated principal G-bundle 
for G = GL(r,R),GL(7,C), and GL* (r,R), respectively. 

In this section we will generalize the construction of characteristic classes to 
principal G-bundles for any Lie group G. These are some of the most important 
diffeomorphism invariants of a principal bundle. 


32.1 Invariant Polynomials on a Lie Algebra 


Let V be a vector space of dimension n and VY its dual space. An element of 
Sym‘ (VY) is called a polynomial of degree k on V. Relative to a basis e1,...,€n 
for V and corresponding dual basis a'!,...,a” for VY, a function f: V > Risa 
polynomial of degree k if and only if it is expressible as a sum of monomials of 
degree kin a!,...,00": 


f= gota", (32.1) 


For example, if V = R”*” is the vector space of all n x n matrices, then trX is a 
polynomial of degree | on V and detX is a polynomial of degree n on V. 

Suppose now that g is the Lie algebra of a Lie group G. A polynomial f: g— R 
is said to be Ad(G)-invariant if for all g € G and X € g, 


f ((Adg)X) = f(X). 


For example, if G is the general linear group GL(n,R), then (Adg)X = gXg~! and 
trX and detX are AdG-invariant polynomials on the Lie algebra gl(n,R). 


32.2 The Chern—Weil Homomorphism 


Let G be a Lie group with Lie algebra g, P — M a principal G-bundle, @ an Ehres- 


mann connection on P, and Q the curvature form of @. Fix a basis e1,...,e, for g 
and dual basis a!,..., 0” for g’. Then the curvature form Q is a linear combination 
Q= Fe, 


where the coefficients Q! are real-valued 2-forms on P. If f: g—> R is the polynomial 
Yaa! ---a'k, we define f(Q) to be the 2k-form 
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f(Q) = YaQ! n--- AQ" 


on P. Although defined in terms of a basis for g, the 2k-form f(Q) is independent of 
the choice of a basis (Problem 32.2). 
Recall that the covariant derivative D@ of a k-form @ on a principal bundle P is 
given by 
(D@)p(“1, es Vk) = (d~)p(hy1,. ..,Avk), 


where v; € T,,P and hy; is the horizontal component of 1j. 


Lemma 32.1. Let x: P — M be a principal bundle. If ~ is a basic form on P, then 
do=D¢Q. 


Proof. A tangent vector X, € T,P decomposes into the sum of its vertical and hori- 
zontal components: 
Xp = VXp +hXp. 


Here h: T,P —> T,P is the map that takes a tangent vector to its horizontal compo- 
nent. Since 7,.X, = 1,hX, for all X, € T,P, we have 


Ts = My oh. 
Suppose @ = 2*T for tT € Q*(M). Then 


Do = (d@) oh (definition of D) 
=(dn*t)oh (@ is basic) 
=(m*dt) oh ({21, Prop. 19.5]) 
=dtonmoh (definition of 2*) 


=dTot, (T%, coh = Tx) 
=n'dt (definition of 2*) 
=dn*t ({21, Prop. 19.5]) 
=do (9 =7'T). 


The Chern—Weil homomorphism is based on the following theorem. As before, 
G is a Lie group with Lie algebra g. 


Theorem 32.2. Let Q be the curvature of a connection @ on a principal G-bundle 
m: P —> M, and f an Ad(G)-invariant polynomial of degree k on g. Then 


(i) f (Q) is a basic form on P, i.e., there exists a 2k-form A on M such that f(Q) = 
m*A. 
(ii) A is a closed form. 
(iii) The cohomology class [A] is independent of the connection. 


Proof. (i) Since the curvature Q is horizontal, so are its components Q' and there- 
fore so is f(Q) = Ya7Q" A--- AQ, 


(ii) 


(iii) 
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To check the G-invariance of f(Q), let g € G. Then 
i (f(Q)) a re ( Yaa" Aves AQ!) 
= aro )Av Ar (Q*). 
Since the curvature form Q is right-equivariant, 


r,Q = (Adg')Q 


r3(Y Mei) = ((Adg!)Q)'ei, 
so that 
73 (Q') = ((Adg7')Q)'. 
Thus, 


r*(f(Q)) = Yiar((Adg!)Q)" A.A ((Adg~!)Q)"* 


= f((Adg”')Q) 
= f(Q) (by the AdG-invariance of f). 


Since f(Q) is horizontal and G-invariant, by Theorem 31.12, it is basic. 
Since 71,.: TpP —> Ty(p)M is surjective, 1* : Q*(M) — Q*(P) is injective. There- 
fore, to show that dA = 0, it suffices to show that 


n*dA = dn*A =df(Q) =0. 
If f= Yao... alk, then 
F(Q) = YiaQ! A-- AQ". 


In this expression, each a; is a constant and therefore by Lemma 32.1 
Da; = day = 0. 


By the second Bianchi identity (31.8), DQ = 0. Therefore, DQ! = 0 for each i. 
Since the Q are right-equivariant of type Ad and horizontal, they are tensorial 
forms. By Lemma 32.1 and because D is an antiderivation on tensorial forms 
(Proposition 31.16) 


d(f(Q)) = D(f(Q)) = D(a" n--- Aa") 
= VY qQ" Aw A DQ Ao A Q2K 
Ij 
= 0. 
Let J be an open interval containing the closed interval [0,1]. Then P x J is a 


principal G-bundle over M x I. Denote by p the projection P x I — P to the first 
factor. If @p and @; are two connections on P, then 


@ = (1—t)p* @+tp* a; (32.2) 
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is a connection on P x J (Check the details). Moreover, if i;: P + P x I is the 
inclusion p +> (p,t), then i}@ = @p and 7} @ = @. 
Let 


e 1 
Q=da+ 5 [, | 
be the curvature of the connection @. It pulls back under ig to 


e 1 
HO = dkO+ i610, 


2, 
ee 
= d@ + 7 i010] 
1 
= da + |, @o| 
= Qo, 


the curvature of the connection @p. Similarly, FQ = Q,, the curvature of the 
connection @. 
For any Ad(G)-invariant polynomial 


f= Yaa we yik 
of degree k on g, 
inf (Q) = ig Ya7Q" A A QK 
=YaQi A. AQi 
= f(Qo) 
and 


if (Q) = f(Q1). 
Note that ig and ij: P — P x I are homotopic through the homotopy i;. By the 
homotopy axiom of de Rham cohomology, the cohomology classes [ij f(Q)] 
and [it f (Q)] are equal. Thus, [f(Qo)] = [f(Q1)], or 

*|Ao] = 1* [Aq]. 
By the injectivity of z*, [Ag] = [Aj], so the cohomology class of A is indepen- 
dent of the connection. 


Let 2: P + M be a principal G-bundle with curvature form Q. To every Ad(G)- 


invariant polynomial on g, one can associate the cohomology class [A] € H*(M) such 
that f(Q) = 2*A. The cohomology class [A] is called the characteristic class of P 
associated to f. Denote by Inv(g) the algebra of all Ad(G)-invariant polynomials 
on g. The map 


w: Inv(g) > H*(M) 
f > [A], where f(Q) =7*A, (32.3) 


that maps each Ad(G)-invariant polynomial to its characteristic class is called the 
Chern—Weil homomorphism. 
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Example 32.3. If the Lie group G is GL(r,C), then by Theorem B.10 the ring of 
Ad(G)-invariant polynomials on gl(r,C) is generated by the coefficients f,(X) of 
the characteristic polynomial 


det(AI+X) = Lie ay tama 


The characteristic classes associated to fi(X),..., f,(X) are the Chern classes of a 
principal GL(r,C)-bundle. These Chern classes generalize the Chern classes of the 
frame bundle Fr(£) of a complex vector bundle E of rank r. 


Example 32.4. If the Lie group G is GL(r,R), then by Theorem B.13 the ring of 
Ad(G)-invariant polynomials on gl(7,R) is also generated by the coefficients f;,(X) 
of the characteristic polynomial 


det(AI+X) = Lie me 


The characteristic classes associated to f|(X),...,f;(X) generalize the Pontrjagin 
classes of the frame bundle Fr(£) of a real vector bundle E of rank r. (For a real 
frame bundle the coefficients f,(Q) vanish for k odd.) 


Problems 

32.1. Polynomials on a vector space 

Let V be a vector space with bases e),...,é, and u1,...,U,. Prove that if a function 
f: V — Ris a polynomial of degree k with respect to the basis e),...,e,, then it is 
a polynomial of degree k with respect to the basis u1,...,u,. Thus, the notion of a 


polynomial of degree k on a vector space V is independent of the choice of a basis. 


32.2. Chern—Weil forms 
In this problem we keep the notations of this section. Let e),...,@, and u1,...uy, be 
two bases for the Lie algebra g with dual bases a!,...,@” and B!,...,B”, respec- 


tively. Suppose 
Q = ¥ Qe; = Yeu; 


f= Yaa Qk = > 5/8" 1 Ble, 


and 


Prove that 
Yig7Q! A AQH = ¥ bP A. ABE, 
This shows that the definition of f(Q) is independent of the choice of basis for g. 


32.3. Connection on P x [ 
Show that the 1-form @ in (32.2) is a connection on P x J. 


32.4. Chern—Weil homomorphism 
Show that the map w: Inv(g) — H*(M) in (32.3) is an algebra homomorphism. 


Appendices 


$A Manifolds 


This appendix is a review, mostly without proofs, of the basic notions in the theory 
of manifolds and differential forms. For more details, see [21]. 


A.1 Manifolds and Smooth Maps 


We will be following the convention of classical differential geometry in which vec- 
tor fields take on subscripts, differential forms take on superscripts, and coefficient 
functions can have either superscripts or subscripts depending on whether they are 
coefficient functions of vector fields or of differential forms. See [21, 84.7, p. 44] for 
a more detailed explanation of this convention. 

A manifold is a higher-dimensional analogue of a smooth curve or surface. Its 
prototype is the Euclidean space R”, with coordinates r!,...,r”. Let U be an open 
subset of R”. A function f = (f!,...,f’"): U > R” is smooth on U if the partial 
derivatives O*f /dr/! ---drJk exist on U for all integers k > 1 and all j,,..., jg. In this 
book we use the terms “smooth” and “C”” interchangeably. 

A topological space M is locally Euclidean of dimension n if for every point p in 
M, there is ahomeomorphism @ of a neighborhood U of p with an open subset of R”. 
Such a pair (U,@: U — R") is called a coordinate chart or simply a chart. If p € U, 
then we say that (U,@) is a chart about p. A collection of charts {(Ua,¢a: Ua 
IR")} is C* compatible if for every o and B, the transition function 


bao dg': op(UaNUg) + ba(UaN Ug) 


is C*. A collection of C* compatible charts {(Ua, Qo: Ua — R")} that cover M is 
called a C* atlas. A C™ atlas is said to be maximal if it contains every chart that is 
C* compatible with all the charts in the atlas. 
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Definition A.1. A topological manifold is a Hausdorff, second countable, locally 
Euclidean topological space. By “second countable,” we mean that the space has 
a countable basis of open sets. A smooth or C~ manifold is a pair consisting of a 
topological manifold M and a maximal C™ atlas {(Uq,@qa)} on M. In this book all 
manifolds will be smooth manifolds. 


In the definition of a manifold, the Hausdorff condition excludes certain patho- 
logical examples, while the second countability condition guarantees the existence 
of a partition of unity, a useful technical tool that we will define shortly. 

In practice, to show that a Hausdorff, second countable topological space is a 
smooth manifold it suffices to exhibit a C™ atlas, for by Zorn’s lemma every C™ atlas 
is contained in a unique maximal atlas. 


Example A.2. Let S! be the circle defined by x* +” = | in R?, with open sets (see 
Figure A.1) 


U {ayes [eeu 
Uz ={(x,y) € S" |x < 0}, 
Uy = {(x,y) €S' | y > 0}, 
Uy = {(x,y) €S' | y < O}. 
ut 
gs uy 
s} U> 


y 


Fig. A.1. A C® atlas on S!. 


Then {(U,',y), (Ux ,y), (Uy ,x), (Uy ,x)} is a C® atlas on S'. For example, the tran- 
sition function from 


the open interval ]0,1[ =x(U NU, ) > y(Us NU, ) =]-1,0[ 
is y= —V1—2, which is C® on its domain. 


A function f: M — R” ona manifold M is said to be smooth or C™ at p € M if 
there is a chart (U,@) about p in the maximal atlas of M such that the function 


food ':R">0(U) > R" 
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is C*. The function f: M — R” is said to be smooth or C™ on M if it is C® at every 
point of M. Recall that an algebra over R is a vector space together with a bilinear 
map : Ax A —- A, called multiplication, such that under addition and multiplica- 
tion, A becomes a ring. Under addition, multiplication, and scalar multiplication, the 
set of all C* functions f: M — R is an algebra over R, denoted by C*(M). 

A map F': N + M between two manifolds is smooth or C™ at p € N if there is a 
chart (U,@) about p in N and a chart (V, yw) about F(p) in M with V D> F(U) such 
that the composite map yo Fog ~!: R"D o(U) 4 w(V) CR” is C® at o(p). It is 
smooth on N if it is smooth at every point of VN. A smooth map F': N + M is called 
a diffeomorphism if it has a smooth inverse, i.e., a smooth map G: M — N such that 
FoG=I1y andGoF=I1y. 

A typical matrix in linear algebra is usually an m x n matrix, with m rows and 
n columns. Such a matrix represents a linear transformation F': R” — R”. For this 
reason, we usually write a C* map as F: N — M, rather than F: M — N. 


A.2 Tangent Vectors 


The derivatives of a function f at a point p in R” depend only on the values of f in 
a small neighborhood of p. To make precise what is meant by a “small” neighbor- 
hood, we introduce the concept of the germ of a function. Decree two C™ functions 
f: U — Rand g: V — R defined on neighborhoods U and V of p to be equivalent 
if there is a neighborhood W of p contained in both U and V such that f agrees with 
g on W. The equivalence class of f: U — R is called the germ of f at p. 

It is easy to verify that addition, multiplication, and scalar multiplication are 
well-defined operations on the set C;" (M) of germs of C® real-valued functions at p 
in M. These three operations make C7 (M) into an algebra over R. 


Definition A.3. A point-derivation at a point p of a manifold M is a linear map 
D: C;(M) — R such that for any f, g € C?(M), 


D(fg) = (Df)a(p) + f(p)De. 


A point-derivation at p is also called a tangent vector at p. The set of all tangent 
vectors at p is a vector space T;,,M, called the tangent space of M at p. 


Example A.4. If r!,...,r" are the standard coordinates on R” and p € R”, then the 
usual partial derivatives 


a 
or! 


7) 


ae ate ee 
6 or 


P 
are tangent vectors at p that form a basis for the tangent space T,(IR"). 


Ata point p in a coordinate chart (U,@) = (U,x!,...,x"), where x! = r' o @ is the 
ith component of $, we define the coordinate vectors 0 /dx'|, € TpM by 


O o 
ae | = or 


foo? for each f < C;(M). 
o(p) 
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If F: N— M isaC® map, then at each point p € N its differential 
Fp: TpN — Tr(p)M, (A.1) 
is the linear map defined by 
(F..pXp)(t) = Xp(ho F) 


for X, € T,N and he Crip) (M). Usually the point p is clear from context and we 
may write F;, instead of F;,,. It is easy to verify that if fF: N— M and G: M— P 
are C* maps, then for any p € N, 


(G oF )xp = G, F(p) ° Fy p, 


or, suppressing the points, 
(Go F).. =G, ofF,. 


A.3 Vector Fields 


A vector field X on a manifold M is the assignment of a tangent vector X, € T,M to 
each point p € M. At every p ina chart (U,x!,...,x"), since the coordinate vectors 
0/dx'|, form a basis of the tangent space T,M, the vector X, can be written as a 
linear combination 


ete: en 
X= da (P) oe with a'(p) ER. 


p 


As p varies over U, the coefficients a'(p) become functions on U. The vector field X 
is said to be smooth or C™ if M has a C™ atlas such that on each chart (a's seh ih) 
of the atlas, the coefficient functions a’ in X = Ya'd/dx! are C”. We denote the set 
of all C® vector fields on M by X(M). It is a vector space under the addition of vector 
fields and scalar multiplication by real numbers. As a matter of notation, we write 
tangent vectors at p as X,,Y,,Z, € T,M, or if the point p is understood from context, 
aS V1,V2,...,V% © T)M. As a shorthand, we sometimes write 0; for 0/0x’. 

A frame of vector fields on an open set U C M is a collection of vector fields 
X1,...,Xn on U such that at each point p € U, the vectors (X1)p,...,(Xn)p form a 
basis for the tangent space T,M. For example, in a coordinate chart (U i evanyle Ns 
the coordinate vector fields 0/dx',...,0/0x" form a frame of vector fields on U. 

A C® vector field X on a manifold M gives rise to a linear operator on the vector 
space C*(M) of C® functions on M by the rule 


(Xf)(p)=Xpf for f €C°(M) and pe M. 


To show that Xf is a C” function on M, it suffices to write X in terms of local 
coordinates x!,...,x ina neighborhood of p, say X = Yad /dx'. Since X is assumed 
C®, all the coefficients a’ are C*. Therefore, if f is C*, then Xf = Yad f/dx! is 
also. 
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The Lie bracket of two vector fields X,Y € X(M) is the vector field [X,Y] de- 
fined by 


IX,Y]pf =Xp¥f)—¥,(Xf) for pe M and f € Cy (M). (A.2) 


So defined, [X,Y], is a point-derivation at p (Problem A.3(a)) and therefore [X,Y] 
is indeed a vector field on M. The formula for [X,Y] in local coordinates (Prob- 
lem A.3(b)) shows that if X and Y are C®, then so is [X,Y]. 

If f: N+ M isaC™ map, its differential f.»: Tp>N — Ty(p)M pushes forward a 
tangent vector at a point in N to a tangent vector in M. It should be noted, however, 
that in general there is no push-forward map f,: X(N) — X%(M) for vector fields. For 
example, when f is not one-to-one, say f(p) = f(q) for p #q in N, it may happen 
that for some X € X(N), fe. pXp A fxgXq; in this case, there is no way to define f,.X 
so that (f:X) ¢(p) = fx,pXp for all p € N. Similarly, if f: N — M is not onto, then 
there is no natural way to define f,.X at a point of M not in the image of f. Of course, 
if f: N > M is a diffeomorphism, then the pushforward f,.: %(N) > X(M) is well 
defined. 


A.4 Differential Forms 


For k > 1, ak-form or a form of degree k on M is the assignment to each p in M of 
an alternating k-linear function 


@y: TypM x---XT,M > R. 
——_—_—"/ 


k copies 
Here “alternating” means that for every permutation o of the set {1,2,...,k} and 
Vigeee VEE T,M, 
Op (Vo(1)1-++»%o(k)) = (SENG) @p(V1,--- VE), (A.3) 


where sgno, the sign of the permutation o, is +1 depending on whether o is even 
or odd. We define a 0-form to be the assignment of a real number to each p € M; in 
other words, a 0-form on M is simply a real-valued function on M. When k = 1, the 
condition of being alternating is vacuous. Thus, a 1-form on M is the assignment of a 
linear function @,: T,M — R to each p in M. For k < 0, a k-form is 0 by definition. 

An alternating k-linear function on a vector space V is also called a k-covector 
on V. As above, a 0-covector is a constant and a 1-covector on V is a linear function 
f: VR. Let Ax (V) be the vector space of all k-covectors on V. Then Ag(V) =R 
and A;(V) = VY := Hom(V,R), the dual vector space of V. In this language a k-form 
on M is the assignment of a k-covector @, € Ax(T,M) to each point p in M. 

Let S, be the group of all permutations of {1,2,...,k}. A (k,2)-shuffle is a 
permutation o € S;4¢ such that 


o(1) <---<o(k) and o(k+1) <---<o(k+2). 
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The wedge product of a k-covector @ and an ¢-covector B on a vector space V is by 
definition the (k + @)-linear function 


(A B)(v1,-.-, Vere) = ¥(sgno)o(Vo(1),---,Vo(e))B(Vo(et1)s-++Vo(k+e)), (A-4) 


where the sum runs over all (x, 2)-shuffles. For example, if @ and f are 1-covectors, 
then 


(@ AB) (v1, v2) = &(v1)B(v2) — &(v2)B (v1). 


The wedge of a 0-covector, i.e., a constant c, with another covector @ is simply 
scalar multiplication. In this case, in keeping with the traditional notation for scalar 
multiplication, we often replace the wedge by a dot or even by nothing: c\ @ = 
c-O=co. 

The wedge product a / B is a (k+ £)-covector; moreover, the wedge operation 
A is bilinear, associative, and anticommutative in its two arguments. Anticommuta- 
tivity means that 


aAB = C1 1)48=de8B g AQ. 


Proposition A.5. /f a',...,a@" is a basis for the 1-covectors on a vector space V, 
then a basis for the k-covectors on V is the set 


fain. -Aa@k|1 <i ce <ig <n}. 


A k-tuple of integers J = (i1,..., ix) is called a multi-index. If i, <--- < ig, we 
call J an ascending multi-index, and if ij < --- < ix, we call J a strictly ascending 
multi-index. To simplify the notation, we will write a! = a! A---A alk. 

As noted earlier, for a point p in a coordinate chart (U ,x!,...,x”), a basis for the 
tangent space 7,,M is 


7] 
ax! 


ra) 
aii: A 
i Ox 


Let (dx'),,...,(dx"), be the dual basis for the cotangent space A\(T,M) = iM, 


1.€., 
i 0 i 


By Proposition A.5, if @ is a k-form on M, then at each p € U, @, is a linear combi- 
nation: 


Op = Yar(p) )(dx') p= Larl) )(dx!!)p A+++ A (dx**) p. 


We say that the k-form @ is smooth if M has an atlas {(U,x!,...,”)} such that on 
each U, the coefficients ay: U — R of @ are smooth. By differential k-forms, we 
will mean smooth k-forms on a manifold. 

A frame of differential k-forms on an open set U C M is a collection of dif- 
ferential k-forms @),...,@, on U such that at each point p € U, the k-covectors 
(@1)p,---,(@-)p form a basis for the vector space Ay (T,M) of k-covectors on the 
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tangent space at p. For example, on a coordinate chart (U,x!,...,x”), the k-forms 
dx! = dx! AN.» Adx't, 1 <i; <-+++ <ig <n, constitute a frame of differential k-forms 
on U. 

A subset B of a left R-module V is called a basis if every element of V can be 
written uniquely as a finite linear combination >'7;b;, where r; € R and b; € B. An 
R-module with a basis is said to be free, and if the basis is finite with n elements, then 
the free R-module is said to be of rank n. It can be shown that if a free R-module has 
a finite basis, then any two bases have the same number of elements, so that the rank 
is well defined. We denote the rank of a free R-module V by rkV. 

Let Q*(M) denote the vector space of C” k-forms on M and let 


Q*(M) = Dak). 
k=0 


If (U,x!,...,x") is a coordinate chart on M, then Q*(U) is a free module over C*(U) 
of rank i) , with basis dx! as above. 

An algebra A is said to be graded if it can be written as a direct sum A = @y_p Ax 
of vector spaces such that under multiplication, A, - Av C Ag+e. The wedge product 
A makes Q*(M) into an anticommutative graded algebra over R. 


A.5 Exterior Differentiation on a Manifold 


An exterior derivative on a manifold M is a linear operator d: Q*(M) > Q*(M), 
satisfying the following three properties: 


(1) d is an antiderivation of degree 1, i.e., d increases the degree by | and for @ € 
Q*(M) and t € Q'(M), 


d(@At) =daAt+(—1)'w dt; 


(2) d#@=dod=0; 
(3) ona0-form f €C*(M), 


(df) p(Xp) =Xpf for pe M and X, € T,M. 


By induction the antiderivation property (1) extends to more than two factors; for 
example, 


d(@ATAN) =d@AtAN+(—1)*2°@AdtAn + (1)? wA TAdnN. 


The existence and uniqueness of an exterior derivative on a general manifold is 
established in [21, Section 19]. To develop some facility with this operator, we will 
examine the case when M is covered by a single coordinate chart (U,x!,...,x”). This 
case can be used to define and compute locally on a manifold. 


Proposition A.6. Let (U,x!,...,x") be a coordinate chart. Suppose d: Q*(U) + 
Q*(U) is an exterior derivative. Then 
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(i) for any f € Q°(U), 


ap=y oF 


(ii) d(dx') = 0; 
(iii) for any aydx! € Qk (M), d(aydx!) = da; A dx! 


Proof. (i) Since (dx!),,...,(dx")» is a basis of 1-covectors at each point p € U, 
there are constants a;(p) such that 


(4f)p = Lair) (dx 
Suppressing p, we may write 
df =>, ajdx. 
Applying both sides to the vector field 0 /0x! gives 


ar(35)= Daves’ (5, +)= Laid} =ay 


On the other hand, by property (3) of d, 


ar( 35) = 219). 


Hence, aj = 0 f/dx!/ and df =>(0f/dx!) dx. 
(ii) By the antiderivation property of d, 
d(dx') = d(dx" A---Adx'*) = ¥(-1)/!dx" A---Addx'i A---Adx'* 
7 
=0 since d? = 0. 
(iii) By the antiderivation property of d, 


d (a;dx') = da; \dx' +.a;d(dx') 
= da, \dx! since d(dx') = 


Proposition A.6 proves the uniqueness of exterior differentiation on a coordinate 
chart (U,x!,...,x"). To prove its existence, we define d by two of the formulas of 
Proposition A.6: 

(i) if f € Q°(U), then df = Y(df/dx') dx’; 
(iii) if © = Yaydx! € Q*(U) for k > 1, then dw = Y.da; A dx'. 

Next we check that so defined, d satisfies the three properties of exterior differ- 
entiation. 


(1) For @ € QK(U) and t € Q(U), 
d(@ At) = (dw) \t+(—1)k@ dt. (A.5) 
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Proof. Suppose @ = Y.a;dx!' and t = Yb, dx’. On functions, d(fg) = (df)g + 
f (dg) is simply another manifestation of the ordinary product rule, since 


po E agree 
=E (a0 S5e) 
=(E5ha “a 6D Sav 


= (df)g+ fdg. 


Next suppose k > 1. Since d is linear and / is bilinear over R, we may assume 
that @ = a;dx! and t = b; dx’, each consisting of a single term. Then 


d(@ AT) = d(a;b; dx' \ dx’) 
= d(ayb;) \dx! A dx’ (definition of d) 
= (day)by \dx! \ dx! +a; db; \dx' Ndx! 
(by the degree 0 case) 
= da; \dx! \ by dx! + (—1)*aydx! Adby; \dx! 
=dw\t+(—-1)'wAdt. 


(2) d* =0 on QK(U). 


Proof. This is a consequence of the fact that the mixed partials of a function are 
equal. For f € Q°(U), 


non ay 
2 = J 
df= a(S 364 ) -33 36 -dx! \ dx’. 


In this double sum, the factors 07 f/0x/0x' are symmetric in i, j, while dx/ \ dx! are 
skew-symmetric in i, 7. Hence, for each pair i < j there are two terms 


ee 7ax' Ndx!, ce dx! dx! 
that add up to zero. It follows that d*,f = 0. 
For @ = Ya;dx! € QF(U), where k > 1, 
d’o=d (Sida; \ dx') (by the definition of da) 
= ¥\(d*a;) \ dx! + da; \d(dx') 


=0. 


In this computation, d*a7 = 0 by the degree 0 case, and d(dx!) = 0 follows as in the 
proof of Proposition A.6(ii) by the antiderivation property and the degree 0 case. 
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(3) For f aC® function and X a C® vector field on (U,x!,...,x"), (df)(X) =Xf. 


Proof. Suppose X = Xa/0/dx/. Then 
r 2 y a! ce =a 
sad oxi ie 


A.6 Exterior Differentiation on R? 


On R? with coordinates x,y,z, every smooth vector field X is uniquely a linear com- 
bination 


0 7] 0 
tae lay oe ye 


with coefficient functions a,b,c € C*(R*). Thus, the vector space X(IR*) of smooth 
vector fields on R? is a free module of rank 3 over C*(R*) with basis {0 /dx,0/dy, 
0/dz}. Similarly, Q3 (IR) is a free module of rank 1 over C*(IR*) with basis {dx \ 
dy \ dz}, while Q! (IR?) and Q?(IR3) are free modules of rank 3 over C*(R*) with 
bases {dx,dy,dz} and {dy \ dz, dz dx, dx \ dy}, respectively. So the following 
identifications are possible: 


functions = 0-forms <¢<—> 3-forms 
f = f <> fdxAdyAdz 


and 


vector fields << 1-forms o 2-forms 
X = (a,b,c) ~ adx+bdy+cdz & adyNdz+bdz\dx+cdx/Ady. 


We will write f. = 0 f/dx, fy =Of/dy, and f, =O f/dz. On functions, 
df = frdx+ fydy + f, dz. 
On 1-forms, 
d(adx+bdy+cdz) = (cy —b,) dy \dz— (cx — az) dz dx + (by — ay) dx A dy. 
On 2-forms, 
d(ady \dz+bdz\dx+cdx/ dy) = (ax+by+c,)dxAdy/ dz. 


Identifying forms with vector fields and functions, we have the following correspon- 
dences: 


d(0-form) — gradient of a function, 
d(1-form) > curl of a vector field, 


d(2-form) > divergence of a vector field. 
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A.7 Pullback of Differential Forms 


Unlike vector fields, which in general cannot be pushed forward under a C” map, 
differential forms can always be pulled back. Let F: N — M be aC” map. The 
pullback of a C® function f on M is the C* function F* f := f oF on N. This 
defines the pullback on C® O-forms. For k > 0, the pullback of a k-form @ on M is 
the k-form F*@ on N defined by 


(F*@)p(v1,-- . Vk) = OF (p) (Fx,pV1, ee Fy pk) 


for p € N and y,...,vg € T,M. From this definition, it is not obvious that the pull- 
back F* @ of aC” form @ is C~. To show this, we first derive a few basic properties 
of the pullback. 


Proposition A.7. Let F: N — M be a C® map of manifolds. If @ and T are k-forms 
and o is an ¢-form on M, then 

(i) F*(@+T) = F*o+F*7; 

(ii) for any real number a, F*(a@) = aF*a; 
(iii) F*(@ Ao) = F* a) F*o; 

(iv) for any C® function h on M, dF*h = F*dh. 
Proof. The first three properties (i), (ii), (iii) follow directly from the definitions. To 
prove (iv), let p € N and X, € T,N. Then 


(dF*h) p(Xp) =Xp(F*h) (property (3) of d) 
= X)(hoF) (definition of F*h) 
and 
(F"dh) p(Xp) = (dh) r(p)(Fx,pXp) (definition of F*) 
= (Fy pXp)h (property (3) of d) 
=X,(hoF). (definition of F;. p) 
Hence, 
dF*h=F*dh. 


We now prove that the pullback of a C* form is C”. On a coordinate chart 
(U ae ...,+”") in M, a C® k-form @ can be written as a linear combination 


oO= Yay dx"! A+ Adx't, 
where the coefficients a; are C” functions on U. By the preceding proposition, 
F*w = ¥\(F*az) d(F*x!!) A---Ad(F*x'*) 
= Vi (ayo F) d(x! oF) A+++ Ad(x'* oF), 


which shows that F*@ is C™, because it is a sum of products of C~ functions and C~ 
1-forms. 
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Proposition A.8. Suppose F : N — M is a smooth map. On C® k-forms, dF* = F*d. 


Proof. Let @ € Q*(M) and p € M. Choose a chart (U,x!,...,x") about p in M. 
On U, 
o= Ya; dx"! AeA dxtk. 


As computed above, 
F*o = Vi(ay 0 F)d(x' oF) A+++ A d(x oF). 
Hence, 
dF*@ = ¥ d(ay0 F) A d(x"! 0 F) A+++ Ad(x'* 0 F) 
= \d(F*a;) \d(F*x") \---Ad(F*x'*) 
=) F*da; A F*dx'! \---\F*dx'# 
(dF* = F*d on functions by Prop. A.7(iv)) 
=) F* (da; A dx"! A+++ Adx't) 
(F* preserves the wedge product by Prop. A.7(iii)) 
= F*da. 


In summary, for any C® map F' : N > M, the pullback map F* : Q*(M) > Q*(N) 
is an algebra homomorphism that commutes with the exterior derivative d. 


Example A.9 (Pullback under the inclusion map of an immersed submanifold). Let 
N and M be manifolds. A C” map f: N — M is called an immersion if for all 
peéN, the differential f..,: T,N — Ty(p)M is injective. A subset S of M with a 
manifold structure such that the inclusion map i: S <> M is an immersion is called 
an immersed submanifold of M. An example is the image of a line with irrational 
slope in the torus R*/Z?. An immersed submanifold need not have the subspace 
topology. 

Ifo € Qk (M), p € S, and v,...,v% € TS, then by the definition of the pullback, 


(EO) Visesis¥e) _ Wj(p) (ix V1, +++) ixVic) — @p(V1,-+-,Vk)- 


Thus, the pullback of @ under the inclusion map i: S <> M is simply the restriction 
of @ to the submanifold S. We also adopt the more suggestive notation @|s for i*@. 


Problems 


A.1. Connected components 


(a) The connected component of a point p in a topological space S is the largest connected 
subset of S containing p. Show that the connected components of a manifold are open. 

(b) Let Q be the set of rational numbers considered as a subspace of the real line R. Show 
that the connected component of p € Q is the singleton set {p}, which is not open in Q. 
Which condition in the definition of a manifold does Q violate? 
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A.2. Path-connectedness versus connectedness 

A topological space S is said to be locally path-connected at a point p € S if for every neigh- 
borhood U of p, there is a path-connected neighborhood V of p such that V C U. The space S 
is locally path-connected if it is locally path-connected at every point p € S. A path compo- 
nent of S is a maximal path-connected subset of S. 


(a) Prove that in a locally path-connected space S$, every path component is open. 
(b) Prove that a locally path-connected space is path-connected if and only if it is connected. 


A.3. The Lie bracket 
Let X and Y be C™ vector fields on a manifold M, and p a point in M. 


(a) Define [X,Y], by (A.2). Show that for f,g € Cp (M), 


X,Y] p(fa) = (XY pf)g(p) + fp) (X,Y |g). 


Thus, [X,Y], is a tangent vector at p. 
(b) Suppose X = Ya'd; and Y = yb; in a coordinate neighborhood (U,x!, ...,x") of p in 
M. Prove that 
IX, Y] = ¥(a/a;b' = b/ dja!) 0}. 


ij 
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$B Invariant Polynomials 


Let X = {x ‘] be an r x r matrix with indeterminate entries x!, ;- Over any field F, 


a polynomial P(X) € Fix’ ‘| in the r* variables x‘, , is said to be invariant if for all 
invertible matrices A in the general linear group GL(r, F’), we have 


P(A~!XA) = P(X). 


Over the field of real or complex numbers, there are two sets of obviously invariant 
polynomials: (1) the coefficients f;(X) of the characteristic polynomial of —X: 


det(AI+X) = Lie pO) een 


and (2) the trace polynomials £,(X) = tr(X*). This appendix contains results on 
invariant polynomials needed in the sections on characteristic classes. We discuss 
first the distinction between polynomials and polynomial functions. Then we show 
that a polynomial identity with integer coefficients that holds over the reals holds 
over any commutative ring with 1. This is followed by the theorem that over the 
field of real or complex numbers, the ring of invariant polynomials is generated by 
the coefficients of the characteristic polynomial of —X. Finally, we prove Newton’s 
identity relating the elementary symmetric polynomials to the power sums. As a 
corollary, the ring of invariant polynomials over R or C can also be generated by the 
trace polynomials. 


B.1 Polynomials Versus Polynomial Functions 


Let R be a commutative ring with identity 1. A polynomial in n variables over R is an 
element of the R-algebra R[x1,...,Xn], where x1, ..., X, are indeterminates. A poly- 
nomial P(x) € R[x1,--.,Xn] defines a function P: R" > R by evaluation. Hence, if 
Fun(R",R) denotes the R-algebra of functions from R” to R, then there is a map 


€: R[x,...,Xn] > Fun(R”,R), 
PrP. 


By the definition of addition and multiplication of functions, the map € is clearly 
an R-algebra homomorphism. An element in the image of € is called a polynomial 
function over R. 


Example. Let F be the field Z/pZ for a prime p. The set F* = (Z/pZ)* of nonzero 
elements of F forms a group of order p— 1 under multiplication. By Lagrange’s 
theorem from group theory, x?~! = 1 for all x € F*. Therefore, allowing x = 0, we 
get x? =x for all x € F. Thus, although x? and x are distinct polynomials in F [x], 
they give rise to the same polynomial functions in Fun(F"”, F’). In this example, the 
map 

€: F[x] > Fun(F,F) 


is not injective. 
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Proposition B.1. /f F is an infinite field (of any characteristic), then a polynomial 
function P: F" — F is the zero function if and only if P is the zero polynomial; i.e., 
the map 


€: F[xy,...,X,] > Fun(F", F) 
is injective. 


Proof. If P is the zero polynomial, then of course P is the zero function. We prove 
the converse by induction on n. Consider first the case n = 1. Suppose P(x) is 
a polynomial of degree m in x such that P is the zero function. Since a nonzero 
polynomial of degree m can have at most m zeros, and P(x) vanishes on an infinite 
field, P(x) must be the zero polynomial. 

Next, we make the induction hypothesis that € is injective whenever the number 
of variables is <n — 1. Let 


m 
P( X15 205% Xn) = D Aig 
k=0 


where P;(x,,...,%,—1) are polynomials in n—1 variables. Suppose P is the zero 
function on F”. Fix (ay,...,dy—1) € F”~!. Then P(aj,...,@n—1,Xn) is a polynomial 
in x», and P(a,...,@n—1,%n) € Fun(F, F) is the zero function. By the one-variable 
case, P(a1,...,@n—1,Xn) is the zero polynomial. It follows that all its coefficients 
P,(a1,-.-,@x—1) are zero. Since (a1,...,@n—1) is an arbitrary point of F"~!, the poly- 
nomial function / is the zero function on F”~!. By the induction hypothesis, the 
P,’s are all zero polynomials. Hence, P(x1,...,%n—1,%X) = Leip Pe (*1,--- Xp xe is 
the zero polynomial. 


Thus, for an infinite field F, polynomial functions on F” may be identified with 
polynomials in F'[x1,...,x,]. 


B.2 Polynomial Identities 


Using Proposition B.1, we can derive the following principle by which a polynomial 
identity over one field can imply the same identity over any commutative ring with 1. 
Contrary to expectation, in this principle a special case implies a general case. 


Proposition B.2 (Principle of extension of algebraic identities). [f a polynomial 
P(x) € Z[x1,...,Xn] with integer coefficients is identically zero when evaluated on 
a field F of characteristic zero, then it is identically zero when evaluated on any 
commutative ring R with 1. 


Proof. Because F has characteristic zero, the canonical map 
Z—>F, kek-l, 


is an injection. The injection Z > F induces an injection Z[x1,...,%n] > F[x1,...,%n] 
of polynomial rings, so that P(x) € Z[x1,...,x%n] may be viewed as a polynomial 
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over F. By hypothesis, the polynomial function P on F” is identically zero. Since F 
is an infinite field, we conclude from Proposition B.1 that P(x) is the zero polynomial 
in Z[x1,...,Xn]- 

If Ris a commutative ring with 1, then there is a canonical map 


Z>R, kwhk-l, 


that induces a canonical map Z[x1,...,Xn] > R[x1,.-.-,Xn], which need not be an in- 
jection, but under which the zero polynomial P(x) in Z|x] goes to the zero polynomial 
in R[x]. This means that P(x) will be identically zero when evaluated on R. 


Example B.3 (A determinantal identity). Let a',, bi, 1 <i,j <,, be indeterminates, 
and let P(A, B) be the polynomial 


P(A,B) = det(AB) — det(A) det(B) € Z[a',,b'). 


It is well known that if A and B are real r x r matrices, then P(A, B) is zero. By the 


principle of extension of algebraic identities (Proposition B.2), for any commutative 
ring R with 1 and for all A,B € R'™’, 


P(A,B) = det(AB) — det(A) det(B) = 0. 


Thus, the truth of the identity det(AB) = det(A) det(B) over R implies its truth over 
any commutative ring R with 1. 


In particular, over any commutative ring R with 1, if a matrix A € R’" is invert- 
ible in R’™", then det(A~!) = (detA)~!. 


B.3 Invariant Polynomials on gI(7, F’) 


Let F be a field, and gl(r,F) = F’*" the F-vector space of r x r matrices with entries 
in F. A polynomial P(X) on gl(r,F) is an element of the commutative F-algebra 
F [x], i.e., it is a polynomial in the indeterminates xii< i,j <r. An invertible 
matrix A € GL(v, F) acts on polynomials on gl(r, F’) by 


(A-P)(X) = P(A~!XA). 
A polynomial P(X) on gl(r, F’) is said to be invariant if 
P(A-'XA) = P(X) for all A € GL(r,F). 


Example B.4 (Determinant). Let F be any field and R = F |x’) the polynomial algebra 
over F in r x r indeterminates. If A € GL(r, F), then 


A E pre CR? 


and X = [x4] is a matrix of indeterminates in R’~", so by Example B.3, 


det(A~!XA) = (detA) | detX detA = detX. 


Therefore, det(X) € F'|x;] is an invariant polynomial on gl(7, F). 
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Although an invariant polynomial on g[(r,F') applies a priori to matrices with 
entries in F, it can in fact be applied to any commutative F-algebra R with identity. 


Proposition B.5. Suppose P(X) € F |x’ ‘] is an invariant polynomial on g\(r,F), and R 
is a commutative F-algebra with identity 1. Then for any A € GL(r,F) and X € R™", 


P(A~!XA) = P(X). 


Proof. Fix an invertible matrix A € GL(r, F). Since P(X) is invariant on gl(r, F’), the 
polynomial 

P,(X) := P(A7!XA) — P(X) € F [x4] 
is by definition the zero polynomial. If R is a commutative F'-algebra with identity 


1, then the canonical map F > R, f +> f- 1, is injective and induces an injection 
F(x’ ‘| > R[x a, under which the zero polynomial P4(X) maps to the zero polynomial. 


Theréibie, P,(X) = P(A~!XA) — P(X) is the zero polynomial in R[x]. It follows 
that for any X € R’*’, 


P(A~!XA) = P(X). 


Let A, xi be indeterminates. Then 
det(AI+X)=Al4+ fi(X)AT 1+. fr-1(K)A + fi (X) 
— -> med Re ke 


The polynomials f,(X) € Z[x' ‘| have integer coefficients. For any field F’, the 
canonical map Z — F is not aeceeaunly injective, for example, if the field F has pos- 
itive characteristic. So the induced map ~: Z[x5] — F |x‘] is not necessarily injective. 
Although it is possible to define the palynomialé J, over any field, we will restrict 
ourselves to fields F of characteristic zero, for in this case the canonical maps Z > F 
and Z\x'] + F [x‘] are injective, so we may view the f;,(X)’s in Z[x' ‘| as polynomials 
over F’. 


Proposition B.6. Let F be a field of characteristic zero. Then the coefficients f;(X) 
of the characteristic polynomial det(AI +X) are invariant polynomials on g\(r, F). 


Proof. With A, x!, as indeterminates, we take R to be the commutative ring F [A, x4] 
with identity. Since det(X) is an invariant polynomial on gl(r, F) (Example B.4), by 
Proposition B.5 for any A € GL(r,F), 


det(AJ-+X) = det(A~!(AI+X)A) 
=det(AI+A!XA). 


Hence, 


Lie) x)ar* = dita AXA 
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Comparing the coefficients of A’—* on both sides gives 


fx(A~!XA) = fi(X) forall A € GL(r,F). 


This proves that all the polynomials f;,(X) are invariant polynomials on gl(r, F). 


For each integer k > 0, the kth trace polynomial is defined to be 
2; (X) = tr(X*) € Zi]. 
Since tr(X) = fi (X) is an invariant polynomial on gl(r,F), for any A € GL(7,F), 
D(A 1XA) = tr(A~!X*A) = tr(X*) = (X). 


So the trace polynomials &;(X) are also invariant polynomials on gl(r, F). 


B.4 Invariant Complex Polynomials 


The goal of this subsection is to determine all complex polynomials on gl(r,C) inv- 
ariant under conjugation by elements of GL(7,C). 

Suppose X € gl(r,C) is a diagonalizable matrix of complex numbers (as opposed 
to indeterminates). Because X is diagonalizable, there exists a nonsingular matrix 
A € GL(r,C) such that 


al 
ATXA=| -.. | =diag(t,...,t,), 
t, 
with f),...,t; being the eigenvalues of X. For such a matrix X, 
tr(X) = (A7!XA) = Yt, 


mcr A'XA) =u. 


Thus, tr(X) and det(X) can be expressed as symmetric polynomials of their eigen- 
values. This example suggests that to an invariant polynomial P(X) on gl(7,C), one 
can associate a symmetric polynomial P(t;,...,t-) such that if t),...,t, are the eigen- 
values of a complex matrix X € gl(r,C), then P(X) = P(t,...,t,). Note that D4 
and J]?; are simply the restriction of tr(X) and det(X), respectively, to the diagonal 
matrix diag(t1,...,t,). 

Let Inv(gl(r,C)) be the algebra of all invariant complex polynomials on gl(7,C), 
and C[t;,...,t,]5" the algebra of complex symmetric polynomials in t),...,t,. To an 
invariant polynomial P(X) € Inv(gl(r,C)), we associate the polynomial 


ty 
Pts. iste) =P 
t, 
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If o € S, is a permutation of {1,...,r}, the permutation matrix P, © GL(r,Z) is 
the matrix whose (i, o(i))-entry is 1 and whose other entries are all 0. For example, 


0 1 
1 0 
Oa 
1 

If o is a transposition, then Pg is called a transposition matrix. 

It is easy to check that conjugating the diagonal matrix diag(t,...,t,) by the 
transposition matrix Pj;;) interchanges f; and t; in diag(t),...,t,). Consequently, 
by conjugating diag(t,,...,t,) by a product of transposition matrices, we can make 


ti,...,¢, appear in any order. Thus, P(t, ...,¢,) is invariant under all permutations of 
ti,...,t;, 1.e., itis a symmetric polynomial in t,...,f;. 


Theorem B.7. The map 


: Inv(gl(r,C)) > CIn,...,t,]% 


P(X) P(t),...,t-) =P 


given by restricting an invariant polynomial P(X) to the diagonal matrix 
diag(t),...,¢,) is an isomorphism of algebras over C. 


The elementary symmetric polynomials in a set of variables t),...,¢, are the 
polynomials 


op 1 =P ae Oe Se 


i<j 


Ox = py fiylig Tig, 
1 <i <ig<+++<ig<r 


O, = t+ fp. 


Example B.8. For any indeterminate A, let 


P(X) = det(AI+X) = Ysile) a 
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Then 


0(P,(X)) =Py —_ = det | Al+ 
t, t, 


=[][(+n) 
i=l 
=F oor. (B.1) 
k=0 
Since @ is an algebra homomorphism, 


9(PA(X)) = 0 (DL Flx)A*) = Y olfa(Xy)a™. (B.2) 
k k 


Comparing the coefficients of A'-* in (B.1) and (B.2) gives 
P(Se(X)) = On(t). 


The proof of Theorem B.7 depends on two facts from algebra: 


(1) The set of all diagonalizable r x r complex matrices is a dense subset of gl(r,C). 
(2) (Fundamental theorem of symmetric polynomials) Every symmetric polyno- 
mial in t,...,t, 1s a polynomial in the elementary symmetric polynomials 0}, 

sy Or: 


Proof (of (1)). Recall from linear algebra that matrices with distinct eigenvalues are 
diagonalizable. The eigenvalues of the matrix X are the roots of its characteristic 
polynomial f(A) = det(AJ —X). The polynomial fy (A) has multiple roots if and 
only if fx(A) and its derivative f{(A) have a root in common. Two polynomials 
f(A) and g(A) have a root in common if and only if their resultant R(f,g), which 
is a polynomial in the coefficients of f and g, is zero. Thus, the matrix X has re- 
peated eigenvalues if and only if R( fx, fy) = 0. Note that the resultant R( fy, fy) isa 
polynomial with coefficients that are functions of entries of X. Since R( fx, fy) =0 
defines a subset of codimension one in gl(n,C), the set of complex matrices with 
distinct eigenvalues is dense in gl(r,C). Hence, the set of diagonalize matrices in 
gl(n,C) is dense. 


Proof (of (2)). This is a standard theorem in algebra. See, for example, [13, 
Chap. IV, §6, Th. 6.1, p. 191]. 


Proof (of Theorem B.7). Injectivity of @: By Proposition B.1, because C is an infinite 
field, complex polynomials may be identified with complex polynomial functions. 
So we will interpret elements of both Inv(gl(7,C)) and C[n,...,1,)5" as polynomial 
functions on gl(r,C) and C’, respectively. 

Suppose the invariant polynomial P(X) vanishes on all diagonal matrices X € 
gl(r,C). By the invariance of P(X) under conjugation, P(X) vanishes on all diago- 
nalizable matrices X € gl(r,C). Since the subset of diagonalizable matrices is dense 
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in gl(r,C), by continuity P(X) vanishes on gl(r,C). Therefore, P(X) is the zero 
function and hence the zero polynomial on gl(7,C). 


Surjectivity of p: We know from Example B.8 that the elementary symmetric polyno- 
mial 0;(t,,...,¢,) is the image of the polynomial f;,(X) under @. By the fundamental 
theorem on symmetric polynomials, every element of C[t,...,t,|5" is of the form 
Q(0|,...,0;,) for some polynomial Q. Then 


Q(01,.--, Or) = Q(P(Fi(X)),--- PF(X))) 
= P(Q(fi(X),---,f(X)) = P(P(X)), 


where P(X) = QO(fi(X),...,f-(X)) € Inv(gl(7,C)). 


Example B.9. If £;,(X) = tr(X*) are the trace polynomials, then 


k 
ty 


Mel tects te Tes =P in 
k 
t 


Fr 


So the symmetric polynomials corresponding to the trace polynomials 2, are the 
power sums Sx. 


Theorem B.10. The ring of invariant complex polynomials on gl\(r,C) is generated 
as an algebra over C by the coefficients f;(X) of the characteristic polynomial 
det(AI+X). Thus, 


Inv(gl(7,C)) = C[fi(X),---f-(X)]- 
Proof. Under the isomorphism 
@: Inv(gl(r,C)) = CIn,...,t,)%, 


the polynomials f,(X) correspond to the elementary symmetric polynomials 
Ox(t1,-.-;t-). Since C[h,...,t-]°* = C[o1,...,0,] by the fundamental theorem on 
symmetric polynomials, the isomorphism @ gives 


Inv(gl(r,C)) = C[fi(X),.-- fr(X)]- 


B.5 L-Polynomials, Todd Polynomials, and the Chern Character 


The L-polynomials, Todd polynomials, and the Chern character are three families of 
polynomials of great importance in algebraic topology. All three are defined in terms 
of elementary symmetric polynomials. 

Consider the formal power series 


Vi 1 4 
j= — 1 | t t see 
fy) tanh \/t 3 45 se 
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Let t,,...,t, be algebraically independent variables over Q. Then the formal power 
series 


i i 
F(th,...yt ) = TL) = TT] (14+ 3 mee) 


i=1 


=14 A(t) +R 6, +BG, at) +> 


is symmetric in f,...,t,, so that for each n its homogeneous component F;,(t1,... ,t,) 
is a symmetric polynomial of degree n. By the fundamental theorem of symmetric 
polynomials, 

Fi(ti,---,tr) =Ln(x1,---,*r), 


where x),...,x; are the elementary symmetric polynomials in t),...,t-. The poly- 
nomials L,(x1,...,x,-) are called the L-polynomials.. The L-polynomials appear in 
Hirzebruch’s signature formula (26.4). 

It is easy to work out the first few L-polynomials: 


Fi Lizes “5D ax = L1(~1), 
Fy(ty,...,ty) yi; +e 5 Day 
9G 
1 
=~ gg (Can? —2 ditt) +5 . tit 
i<j 9G 
= Pay nat 
Bae ae 


1 
= 45 — (Tx) — xt) = Lo(x1,x2). 
Instead of f(t) = \/t/tanh V7, one may apply the same construction to any an- 
alytic function to obtain a sequence of homogeneous polynomials. For example, 
starting with the formal power series of 


=14+F7(4)+Fl(t,n)+-- 
=> 1+7) (41) + 7o(x1,%2)+--: ‘ 


The polynomials T,,(x1,...,Xn,), called the Todd Polynomials, play a key role in the 
Hirzebruch—Riemann-Roch theorem (26.1). 
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Again, it is easy to work out the first few Todd polynomials: 
1 


1 
Fi(t,,...,tr) = = 5" = T,(x1) 


1 1 
Fy(ty,...,tr) = Pee 
i<j 


“Mga 2m3 $0) 


i<j i<j 


1 
= pi +x) = To(x1,X2). 


The Chern character is defined differently from the L-polynomials and the Todd 
polynomials. Instead of the product of f(t;), we take the sum: 


n 
ChGy, 2.5%.) = Se 
i=l 
Since in each degree }?_, e” is a symmetric polynomial in f,...,f,, it can be exp- 
ressed as a polynomial in the elementary symmetric functions x),...,x,. Thus, 
ch(x},.--,%n) =n+chy(x1,...,%n) +eho(x1,.--,%) +..., 
where 
ch; (x1,---52) = Ire 
Then 


eh Gisaeada) = Y= 31, 


cho (x1,...,%,) = > i = (24)? eit 
_ a — x2 
ae: 
The Chern character of a vector bundle E is defined to be 
ch(E) =ch(c1,...,¢n), 


where cj,...,Cn are the Chern classes of E. Formally, if c(E) =[[(1 +4), then 
ch(E) = dew. 


B.6 Invariant Real Polynomials 


In this subsection we will prove the analogue of Theorem B.10 for invariant real 
polynomials, that every invariant real polynomial on gl(r,R) is a real polynomial in 
the elementary symmetric polynomials. 


Proposition B.11. If a real homogeneous polynomial P(X) on gl(r,R) is invariant 
under conjugation by GL(r,R), then it is invariant under conjugation by GL(r,C). 
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Proof. By Cramer’s rule, A~! = A* / detA, where A* is the transpose of the matrix of 
signed minors of A. Suppose P(X) has degree k. Then the equation 


P(AXA~!) = P(AXA* /detA) = P(X) 
is equivalent by homogeneity to 
P(AXA*) = (detA)‘P(X), 


7 q(A,X) := P(AXA*) — (detA)*P(X) =0. (B.3) 


This is a polynomial equation in aj; and x;; that vanishes for (A,X) € GL(7,R) x 
R’™”. By continuity, (B.3) holds for all (A,X) € R’™” x R"*”. Since g(A,X) gives 
a holomorphic function on C”*”" x C’*", if it vanishes identically over R, then it 
vanishes identically over C. (This is a property of holomorphic functions. It can 
be proven by noting that a holomorphic function is expressible as a power series, 
and if the function vanishes identically over R, then the power series over R is zero. 
Since the power series over C is the same as the power series over R, the function is 
identically zero.) 


Proposition B.11 may be paraphrased by the equation 
Inv (gl(7,R)) = Inv(gl(r,C))N R[x]. 
Theorem B.12. Let Qp be the restriction of the map ~ of Theorem B.7 to Inv (gl(r,R)): 


Qe: Inv(gl(r,R)) > Rih,... a 


P(X) +4 P(ti,...sth) =P 


Then Qk is an algebra isomorphism over R. 


Proof. We have a commutative diagram 


Inv(gl(r,C)) > C{i,...,¢-)% 
U U 
Inv (gl(r,R)) “> Rin, ..-,¢,) 


The injectivity of @p follows immediately from the injectivity of @. 

As for the surjectivity of @p, the proof is the same as the complex case (Theo- 
rem B.7). More precisely, every symmetric polynomial p(t),...,t-) € R{ti,...,t;]*" 
is a real polynomial in the elementary symmetric polynomials 0),...,0, of t),...,t: 


D(t1,---5tr) = q(O1,...,Gr). 
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Since o;(t) = @(f;(X)) and ¢ is an algebra homomorphism, 


P(ty +++ str) = 4 (P(Fi(X)), +++ PFr(X))) = 0 (Q( A(X), A(X), 


which proves that @p is onto. 


Theorem B.13. The ring of invariant real polynomials on g\(r,R) is generated as an 
algebra over R by the coefficients f,(X ) of the characteristic polynomial det(AI +X). 
Thus, 


Inv(gl(r,R)) = RUfi(X),..-, f-(X)]- 


Proof. The proof is the same as that of Theorem B.10, with R instead of C. 


B.7 Newton’s Identities 


Among the symmetric polynomials in f,...,t,, two sets are of special significance: 
the elementary symmetric polynomials 0),...,0,- and the power sums s},...,5,. 
Newton’s identities give relations among the two sets. As a corollary, each set gen- 
erates the algebra of symmetric polynomials over R. Define oo = 1. 


Theorem B.14 (Newton’s identities). For each integer k > 1, 


(Eve) + (-1)*ko;, = 0, 


i=0 
or written out, 
Sk — O1Sp—1 + 028-2 — ++» + (—1)*l og_151 + (—1)*keoj, = 0. 
Proof (from (2, p. 79]). Consider the identity 
(l—tx)---(1—t,.x) =1—oyx+ Cox ase (-1)'o,x". 


Denote the right-hand side by f(x) and take the logarithmic derivative of both sides 
to get 


In(1 — 1x) ae +In(1 =14) _ In f(x), 


“4 ote FG) 
1—tx 1—t,.x f(x)" 
Hence, 
tx te. - ae) 


l—-nx' | 1-tx f(x) 


318 $B Invariant Polynomials 


Expanding the left-hand side as a sum of geometric series, we get 


Now multiply both sides by f(x): 


fs) ( Sse") = —xf" (x), 


(1 — oyx+ 9x7 — +» + (-1)'0,x")(s1x-+ 99x +--+) 


= 01x —209x7 +---4 ( Ly “vox. 


A comparison of the coefficients of x* on both sides of the equation gives 


Sk — O1Sp_y $+ +( 1) 64151 = (-1)** ko. 


Using Newton’s identities we can write each elementary symmetric polynomial 
Ox as a polynomial in the power sums. 


Example B.15. For 0; and 02 we can get the relations directly: 
O| eee 
2 => tit) = s(n +--+) -(F +--+) 


ty 


1 
= i 


For 03, Newton’s identities give 
303 = $3 — O1S2 + 0251 
1,9 
= $3 — S182 + 5 (s} — $2) Sy. 


A mathematical induction on k using Newton’s identities proves that every elemen- 
tary symmetric polynomial o; is a polynomial in the power sums s}1,..., 5x. 


Theorem B.16. The algebra of symmetric polynomials over R can be generated as 
an algebra over R by the elementary symmetric polynomials or by the power sums: 


Rity,...,t-]5* =R[oy,...,0-] =RIs1,..., sr]. 


Proof. We have just shown that every ox is a polynomial in s;,...,5,. Conversely, 
since each s; is symmetric, by the fundamental theorem on symmetric polynomials 
it is a polynomial in 0),...,0;. 
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Theorem B.17. The algebra of invariant polynomials on gl(r,R) is generated as an 
algebra over R by the coefficients f\(X),..., f-(X) of the characteristic polynomial 
det(AI +X) or by the trace polynomials tr(X),...,tr(X"). 


Proof. By Examples B.8 and B.9, under the algebra isomorphism 
@: Inv(gl(r,R)) > Rin,... Pele 


the polynomials f;(X),...,f-(X) correspond to the generators 0),...,0, and the 
trace polynomials correspond to the generators s;,...,5,. This theorem then fol- 
lows from Theorems B.13 and B.16. 


An analogous statement holds for the complex invariant polynomials on gl(r,C). 


Problems 


B.1. The invariant polynomial f,(X) 
Let X = [x] be an r x r matrix of indeterminates. The polynomials f,(X) € Z[x'] are the 
coefficients in the characteristic polynomial 
2 
det(AI+X) = > fe(x)ar™. 
k=0 


Find a formula for the polynomial f(X) € Zxi). Write out all the terms of the formula when 
r=3. 


Hints and Solutions to Selected End-of-Section 
Problems 


Problems with complete solutions are starred (*). Equations are numbered consecutively 
within each problem. 


1.1 Positive-definite symmetric matrix 

Fixing a basis e1,...,€n for V defines an isomorphism V > R” by mapping a vector in V to 
its coordinates relative to e1,...,é@,. So we may work exclusively with column vectors in R”. 
Because A is positive-definite, (x,x) = x’ Ax > 0 with equality if and only if x = 0. Because 
A is symmetric, 


(y,x) =y" Ax = (x"A’y)” = (x" Ay)? =x! Ay = (x,y). 


Finally, the expression x’ Ay is clearly linear in x and in y. 


1.2* Inner product 
By linearity in the first argument, 


(u,w) = (v,w) iff (u—v,w) =0. 


Since the inner product is positive-definite, this implies that u— v= 0. 


1.4* Positive linear combination of inner products 
For any v € V, we have (v,v) = Yaj(v,v); > 0. If equality holds, then every term is 0 and 
therefore v=0 by the positive-definiteness of any of the inner products ( , );. Symmetry and 
bilinearity in either argument are just as obvious. 


1.5* Extending a vector to a vector field 

Let (U,x!,...,x") be a coordinate chart about p. If Z) = Ya'd/dx'|,, then Z:= Ya'd/dx' is 
aC™ vector field on U that extends Zp. To extend Zp to a C™ vector field on M, let f be aC” 
bump function supported in U which is 1 at p. Define 


X= f(QZq, for q € U, 
0, forg ¢U. 


Then X is a vector field on M such that Xp = Zp. 
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On U the vector field X is C* because it is the product of a C® function and a C® vector 
field Z. If g ¢ U, then since supp f is a closed subset C U, there is a neighborhood V of g such 
that VM supp f = @. Thus, X = 0 on V, and is trivially C™ at g. This proves that X is C™ at 
every point of M. 


1.6* Equality of vector fields 

(=) If X =Y, then clearly (X,Z) = (Y,Z) for all Z € X(M). 

(<) To prove that X = Y, it suffices to show that X, = Y, for all p € M. Let Zp be any vector 
in T,M. By Problem 1.5, Zp can be extended to a C™ vector field Z on M. By hypothesis, 
(X,Z) = (Y,Z). Hence, 


(Xp,Zp) = (X,Z) yp = (Y,Z) p = (Yp,Zp). 


By Problem 1.2, this implies that X, = Y,. Since p is an arbitrary point of M, it follows that 
X=Y. 


1.7* Hyperbolic upper half-plane 
Since 1/ y? is always positive on H?, by Problem 1.4 the function (, ) is an inner product at 
each point p € H?. It remains to check that if X,Y € X(H7), then (X,Y)! = (1/y?)(X,Y) isa 
C® function on Hi. This is true because being the Euclidean metric, (X,Y) is C” on H? and 
1/y* is C? on H?. 


2.1* Signed curvature 
If ¥'(s) =T(s) = (cos 0(s), sin @(s)), then 


7o1=r'6)= | oineoacs] mm= [Ta ots] = [enol 


Therefore, Y’(s) = 6’(s)n(s). This proves that k = ’(s). 


2.3 Write @ = arctan(y/x). Compute k = d@/ds = (d0/dt)/(ds/dt), where ds/dt = (x7 + 
2) 1/2 
eae 


2.4 In Problem 2.3, set the parameter t = x. 
2.5 « =ab/(a’ sin?t +b? cos”t). 
2.6 (1/27)[-8+(4+9a?)?/?]. 


2.7* Curvature of a space curve 


a) Since 
t t 
=f lwlau= [ Va +Pdu=V2+b1, 
0 0 


t=s/Va2 +b. 
b) Let C= 1/Va2+b?. Then y(s) = (acosCs,asinCs,bCs). A short computation gives 
ly"(s)| =a/(@ +0”). 


3.1 K1,k) =H+2VH2-K. 


5.3* The Gauss map 
Let c(t) be a curve through p with c’(0) = Xp. Define a vector field M(t) along c by N(t) = 
ANe(r)/at. Then 
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d d d| « 
Vx, (X, j= —_— v(c(t)) — ieee Sy ote N(t) 
p\4p dt = dt ro c(t) dt os 
=N'(0)=D.()N _ (by Proposition 4.11) 
= Dy,N = —L(Xp). 


5.4 Area v(M) = Jy) 1 = Jiy|detvs| by the change of variables formula. Now apply Prob- 
lem 5.3. 


5.8 Choose an orthonormal basis e1 ,e2 for T)M and write Xp = (cos @)e; +(sin@)e2. Let the 


shape operator L be represented by the matrix b M relative to e},¢2. Find k(Xp) in terms of 


be 
a,b,c,d and @ and compute iG K(X,)d@. 


5.9 (a) Take the inner product of both sides of the equation L(e,) = ae; +be2 with e; and 
with e2. Then solve for a,b. 


-1 
(b) K =det z | det bi / = (eg — f*)/(EG—F?). 
H = (eG—2fF +gE)/(2(EG—F?)). 
5.10 (a) First show that e; = (1,0,hx), e2 = (0,1, hy). 
(b) By Lemma 5.2, (L(e;),e;) = (De,ej;,N), where N = (e] x e2)/||e1 x e2||. Show that 
De, 1 = (0,0, Axx), Dere1 = (0,0, hxy), Deseo = (0,0, hyy). 
(c) Use Problem 5.9(b). 
5.11 (a) e; = ((sinhu) cos @, (sinhu) sin@, 1), ex = (— (coshu) sin 8, (coshu) cos 8,0). 


(b) E =cosh?u, F =0, G = cosh? u. 
(c) e, = ((coshu) cos @, (coshu) sin @,0), e5 = (— (sinhu) sin @, (sinhu) cos @, 1). 


7.1* Derivation and local operator 

Suppose a C® function g on M is zero on an open set U. Let p € U and choose a bump function 
f © C*(M) such that f(p) = 1 and supp f Cc U. then fg = 0 because g =0 on U and f =0 
on M — U. By the derivation property, 


0 = (D(f8))(p) = (PF) (p)a(p) + f(p)(Ds)(P) = Da(p), 


since g(p) = 0 and f(p) = 1. As p is an arbitrary point of U, Dg =0 on U. 


7.4* Section with a prescribed value 


Let U be a neighborhood of p in M over which E is trivial: Ely £ Ux R*. Suppose 
(e) = (p,v). Then sy (q) := 6! (q,v) is aC® section of E over U with sy(p) = @~!(p,v) =e. 
By Proposition 7.13, there is a C® global section s € I(M,£) that agrees with sy over some 
neighborhood of p. Then s(p) = sy(p) =e. 


7.5* Coefficients relative to a global frame 
It suffices to show that a/ is C~ over any trivializing open set for E. Choose a trivializing open 
set U for E with trivialization 


o:0°'(U)~UxR*, ers (p,b!,...,b*). 
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Let s1,...,5, be the frame over U corresponding to the standard basis in R* under ¢. Then for 
any ve m~!(U), v= Yb's;. The b'’s are C® functions on E because they are components of 
the C* map @. 

Since e;,...,e, 1s aC™ frame over U, 


ej = > fisi 
with f; C*. Then 


v ¥ b's; Yale; ¥ a’ fisi. 


Comparing the coefficients of s; gives 


bi =Dai fi 
J 


In matrix notation, b = Ca. Hence, a=C —Ip, which shows that the a/ are C® because the 
entries of C and b are all C™. 


12.1* The orthogonal group O(2) 


(a) The condition A7A = / is equivalent to the fact that the columns of the matrix A form an 


4 be the first column of A. Then a2 +b? = 1. Since the 


orthonormal basis of R2. Let iF 


second column of A is a unit vector in R? orthogonal to A , it is either EB or |?) ‘ 


Hence, 


with a* +b? = 1. 
(b) In part (a), 


Therefore, 


oo-{f 


The condition a? + b? = 1 implies that (a,b) is a point on the unit circle. Hence, (a,b) is 
of the form (cost, sint) for some t € R. So every element of SO(2) is of the form 


cost —sint 
sint cost] ” 


e+b ath, 


14.1* Geodesic equations 
Differentiating (14.10) with respect to t gives 


/ ky 
2y = ty 1 — ky? = y—__=__ 
V1l- k2y2 


. k2 33 “2, 
= »( *\s = PP (by (14.8)) 


yfl—ky2 


(by (14.10) 
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Thus, 


which is (14.5). 


16.4* Volume form of a sphere in Cartesian coordinates 

The unit outward normal vector field on the sphere S’~!(a) is X = (1/a)¥.x'0/dx'. The 
volume form on the closed ball B*(a) of radius a is the restriction of the volume form on R”; 
thus, 


Vol g(a) = dx! A+++ dx". 
By Theorem 16.11, 
Vol sn-1 (a) = ly (dx! [eee A dx") 


1 : 
= 7 la/ax! (dx! Aves A dx") 


1 a 7 
— =F (=1) ax! A+++A dx! A+++ A dx". 
a 


20.1* Topology of a union 
Using the bijection ¢q@ : Sq — Yo, in (iii), each Sq can be given a topology so that dq becomes a 
homeomorphism. Condition (iv) guarantees that the open subsets of Sy S p are well defined. 
Let J be the topology on S generated by all the open subsets of Sg, for all a; this means TJ is 
the smallest topology containing all such sets. 

If Ug is open in Sg, and Ug is open in Sg, then Ug Ug is open in J. Since Ug NUg C So, 
we need to show that Ug Ug is already open in Sq so that in generating J we did not create 
additional open sets in Sg. By condition (ii), Sa NS B ‘= Sqp is in the collection C, so it is 
open via bgp: Sg — Yop. By the compatibility of Sap and Sq in (ii), Sgg is open Sq. Since 
Ug is open in Sq, Va Sag = Ua Sg is open in Sq. By (iv), Ua Sg is also open in Sz. 
Therefore, (Ug Sp) N Ug =UgN Ug is open in Sp. By (iv) again, it is also open in Sq. 


21.9 * The Maurer—Cartan equation 

Since d@ + 5(8, 6] is a 2-form and therefore is bilinear over the C™ functions, it suffices to 
check the equation on a frame of vector fields. Such a frame is, for example, given by left- 
invariant vector fields. Thus, it suffices to check 


d0(X,Y) + ~[0,6](X,Y) =0 


1 

2 

for two left-invariant vector fields X,Y on G. 
By Problem 21.8, 


d0(X,Y) = X0(Y) —Y@(X) — 0([X,Y]). 


Since 6(X) and 6(Y) are constants, X0(Y) = 0 and YO(X) = 0. Thus, 


d0(X,Y) = —6([X,Y]) = —6.([X,¥]e) = —[X,Y]e — —[Xe,%]. 
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By (21.10), 
(6, 0](X,¥) = [@(X), 0(Y)] — [A(Y), 0(X)] 
= 2[0(X), 0(Y)] 
= 2[0.(Xe),@-(Ye)| since @(X),@(Y) are constants) 
= 2[Xe, Ye] 


The Maurer—Cartan equation follows. 


22.4 If T, is an (a,b)-tensor field and 7) is an (a’,b’)-tensor field, evaluate both sides on 
@1,---;@ara'sVi,--->Yory- 
27.2* Short exact sequence of vector spaces 


By the first isomorphism theorem of linear algebra, 


B 
= ~imj=C. 
i(A) — kerj oe 


Hence, 
dim B — dimi(A) = dimC. 


Because i: A > B is injective, dimi(A) = dimA. Therefore, 


dimB = dimA+dimC. 


27.4* The differential of an action 
By linearity, 
Hx (Xp, lgxA) = Hx (Xp,0) + Hs (0, lgxA), 
so it suffices to compute the two terms on the right separately. 
Let c(t) be a curve through p in P with initial vector X,. Then a curve through (p,g) in 
P x G with initial vector (Xp,0) is (c(t),g). Computing the differential 1, using the curve 
(c(t),g), we get 


Hs. (p,g) (Xp, N)=— 


A curve through (p, g) with initial vector (0, 0g.A) is (p, ge’), so 


d 
Hs. (p,g) (0, £¢xA) = dt u(p, ge") 
*=0 
d tA 
=—|  pse 
dt |,_0 
= Ang. 


Hence, 


Ls. (pg) (Xp. lgxA) = gxXp +Apg. 
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30.3* Lie bracket of horizontal vector fields 


Q(X, Y) = (do+ 3(0-0)) %Y) 


= (do)(X,¥) + 5 ((eo(X),@(¥)] — [o(Y), @(X)]) 
=(d@)(X,Y) (since X, Y are horizontal) 
=Xa(Y)—Yo(X)— @([X,Y]) 
=—o([X,Y]). 

Thus, [X,Y] is horizontal if and only if Q(X,Y) =0. 


31.4* Tensorial forms 
Let pe P, g €G, and w,..., uz € TyP. Then 


rs (Wag) (U1, ---5UK) = Wh A reetlt<< 257 ealle) 
Sig (Wal Tatplli gave Ter get) 
= (8 |) fp | (We(tattr,-.., TeUte)) (because fpg = fp ° p(g)) 
=p(g')wi(ur,...,ue)- 


31.5* Tensorial forms 
Let p € Pand uy,...,uz € TP. Write x = 1(p). Then 


(978) p(ui,-- ste) = fp (Pe (attr. +5 Matte) 
=fp FePottiye sie Uk) 


= Pp (uy ,.-- Uk). 


32.1* Polynomials on a vector space os 
Let a!,..., a” be dual to e;,...,e, and B!,..., B” be dual to wy,..., um. Then of! = dc5B! for 


some invertible matrix [ci], and 


i Yaya oe lk 
= “lk Bil... Bik 
= Doane lp 
So f is also a polynomial of degree k with respect to u1,...,Un- 


32.2" Chern—Weil forms 
Suppose ot’ = S.c',B/. Then 
Qi = a! (Q) = > ci B/(Q) = Yi 
and 
f= Lac} p siclk Bik 
= arc} ioe BN Bh, 
So 
f(¥%)= Lac ae pi 
— Yi aQh aia Qii 
= f(Q). 
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R" Euclidean n-space (p. 2) 
(u!,u?,u>) a point in R? (p. 2) 
a 
u= uw column vector (p. 2) 
(,) 7 dot product, inner product, or Riemannian metric (p. 2) 
\|vI| length of a vector v (p. 2) 
(, )lwxw restriction of ( ,) to a subspace W(p. 3) 
Ae transpose of the matrix A (p. 4) 
T,M tangent space at p to M (p. 4) 
HH? upper half-plane (p. 7) 
[a, b} closed interval from a to b (p. 9) 
T(s) unit tangent vector at time s (p. 11) 
T'(s) derivative of T(s) (p. 11) 
n(s) unit normal vector to a curve at time s (p. 11) 
K signed curvature of a plane curve (p. 12) 
x dx/dt (p. 14) 
a d?x/dt? (p. 14) 
Np normal vector at p (p. 17) 
K(Xp) normal curvature (p. 18) 
H mean curvature (p. 18) 
K Gaussian curvature (p. 18) 
x(M) Euler characteristic (p. 20) 
Det directional derivative of f in the direction Xp (p. 22) 
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partial derivative 0 /dx! (p. 22) 

Lie algebra of all C™ vector fields on a manifold M (p. 23) 
Lie bracket of two vector fields (p. 24) 

torsion tensor (p. 24) 

ring of endomorphisms over R of a vector space V (p. 24) 
curvature tensor (p. 24) 

vector field along a curve (p. 25) 

velocity vector field of a curve (p. 25) 

the vector field 0; evaluated at c(t) (p. 26) 


the set of C® vector fields along a submanifold M in a manifold 
R (p. 26) 


zero set of a function f (p. 29) 

shape operator, Weingarten map (p. 30) 
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gradient of a function f (p. 34) 

Gauss map (p. 39) 

affine connection, connection on a vector bundle (p. 43) 
tangent bundle of M (p.51) 

the set of smooth sections of E over U (p. 51) 
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(p. 62) 
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dual 1-form (p. 84) 

torsion form (p. 84) 

Poincaré disk (p. 86) 


sectional curvature of a plane P (p. 92) 
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dx ® dx tensor product (p. 93) 

ds? Riemannian metric (p. 93) 

U T.()M disjoint union (p. 95) 

D/dt covariant derivative (p. 96) 

IV pushforward of a vector field V(t) along a curve (p. 98) 

ly Christoffel symbols (p. 100) 

Exp, exponential map of a connection (p. 115) 

B(0,€) open ball of radius € centered at 0 (p. 115) 

Sep differential of f at p (p. 116) 

g=T1.G Lie algebra of a Lie group G (p. 119) 

Lo left multiplication by g (p. 119) 

0;:(p) integral curve starting at p (p. 119) 

cx(t) = @(e) integral curve through e of the left-invariant vector field X 
(p. 120) 

exp exponential map for a Lie group G 120) 

Cg conjugation by g (p. 123) 

Ad adjoint representation of a Lie group (p. 123) 

ad adjoint representation of a Lie algebra (p. 124) 
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voly volume form of a Riemannian manifold M (p. 133) 

det determinant (p. 134) 
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¢ angle function along a curve (p. 139) 
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Ej jump angle (p. 142) 

B; interior angle (p. 142) 
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Free(V) free module with basis V (p. 152) 

V @rw tensor product of two R-modules (p. 152) 

Homa(V,W) the set of R-module homomorphisms from V to W (p. 157) 

VY.ve dual of an R-module (p. 157, p. 187) 


TAY) kth tensor power of a module V (p. 162) 
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algebra, 295 Bianchi identity 
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angle bilinear maps 
between two vectors, 2 universal mapping property, 153 
angle function, 139 bilinear maps over F, 59 
anticommutativity, 298 binormal, 15 
antiderivation, 299 bundle isomorphism, 51 
arc length, 2, 10, 128 bundle map, 51 
is independent of parametrization, 128 over a manifold, 51 
arc length function, 10 
arc length parametrization, 10 Cartan, Elie, 71 
ascending multi-index, 298 catenary, 36 
associated bundle, 275 catenoid, 37 
associativity characteristic classes, 212, 220 
of a bi-invariant metric, 125 independence of a connection, 218 
of the tensor product, 161 naturality of, 221 
atlas, 293 of a principal bundle, 290 
vanishing, 223 
base space, 242 characteristic form, 212 
of a vector bundle, 49 closed, 215 
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chart, 293 
chart about a point, 293 
charts 
compatible, 293 
Chern character, 315 
Chern classes 
of a complex vector bundle, 235 
of a principal GL(r, C)-bundle, 291 
Chern—Weil homomorphism, 212, 214, 290 
Christoffel symbols, 99, 100 
for the Poincaré half-plane, 101 
of a surface of revolution, 102 
of the Poincaré disk, 102, 114 
symmetric iff torsion-free, 100 
circle 
volume form, 134 
cobordant, 237 
cocycle condition, 245 
Codazzi—Mainardi equation, 62 
coefficients 
of the first fundamental form, 35 
of the second fundamental form, 36 
coefficients of characteristic polynomial, 
306 
coefficients of characteristic polynomial 
are invariant polynomials, 309 
coefficients of the characteristic polynomial, 
212,213 
compatible charts, 293 
compatible with the Hermitian metric, 234 
complex inner product, 234 
complex invariant polynomials 
generated by coefficients of characteristic 
polynomials, 313 
complex manifold, 236 
complex vector bundle, 228 
component, 31 
congruent matrices, 230 
connected component, 304 
connection 
affine, 43 
at a point, 77 
compatible with the metric, 45, 75 
defined using connection matrices, 209 
Euclidean, 43 
Levi-Civita, 45 
metric 
in terms of forms, 82 
on a complex vector bundle, 234 


on a framed open set, 81 
on a principal bundle, 254, 256 
on a trivial bundle, 72 
on a vector bundle, 72 
Riemannian, 45 
symmetric, 100 
connection forms, 80 
connection matrix, 80 
dependence on frames, 202 
connection on a vector bundle 
existence, 73 
connection-preserving diffeomorphism, 99 
preserves geodesics, 108 
connections 
convex linear combination of, 48 
convex linear combination, 73 
coordinate chart, 293 
coordinate vectors, 295 
covariant derivative 
corresponding to a connection, 262 
covariant derivative 
of tensor fields, 206 
of a basic form, 288 
of a tensorial form, 282 
of a vector field along a curve, 97 
on a principal bundle, 281 
on surface in R3, 104 
covariant differentiation 
along a curve, 95 
covector, 297 
curvature, 44 
G-equivariance, 271 
and shape operator, 32 
Gaussian, 18 
Gaussian, in terms of an arbitrary basis, 
64 
geodesic curvature, 138 
is F-linear, 44 
is horizontal, 271 
is independent of orientation, 13 
mean, 18 
normal, 18 
of Maurer—Cartan connection, 274 
of a connection on a principal bundle, 270 
of a connection on a vector bundle, 74 
of a plane curve, 12, 14 
of a space curve, 15 
of an ellipse, 14 
of directional derivative, 24 


principal, 18 
sectional, 92 
signed geodesic curvature, 140 
symmetries, 205 
total curvature, 146 
total geodesic curvature, 141 
curvature tensor, 207 
independence of orthonormal basis, 92 
curvature forms, 80 
curvature matrix, 80 
dependence on frames, 202 
is skew-symmetric relative to an 
orthonormal frame, 84 
curvature tensor, 74 
skew-symmetry, 91 
curve, 9 
geometric, 9 
parametrized, 9 
piecewise smooth, 112 
regular, 9 
cuspidal cubic 
arc length, 15 
cylinder 
mean and Gaussian curvature, 40 
shape operator, 40 


decomposable 
in the exterior algebra, 164 
in the tensor product, 153 
degree, 147 
of a form, 188 
of a line bundle, 239 
diagonal entries 
of a skew-symmetric matrix is 0, 224 
diagonalizable matrices 
are dense in gl(r,C), 312 
diffeomorphism, 295 
differential, 296 
differential form, 297, 298 
vector-valued, 187 
differential forms 
depending smoothly on a real parameter, 
216 
with values in a vector bundle, 194 
with values in a vector space, 186 
differentiating under an integral sign), 218 
dimension 
of tensor product, 157 
direct sum 
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of vector bundles, 181 
directional derivative 
computation using a curve, 22 
in R”, 22 
has zero curvature, 25 
is compatible with the metric, 25 
is torsion-free, 25 
of a vector field, 22 
of a vector-valued form, 190 
on a submanifold of R”, 27 
properties, 23 
distance 
on a connected Riemannian manifold, 129 
distribution, 254 
horizontal, 251 
dot product, 2 
dual 
of a module, 157 
dual 1-forms 
under a change of frame, 132 
dual 1-forms, 84 


Ehresmann connection, 242, 256 
Einstein summation convention, 80 
elementary symmetric polynomials, 311 
ellipse 
curvature, 14 
equivariant map, 243 
Euclid’s fifth postulate, 110 
Euclid’s parallel postulate, 110 
Euclidean connection, 43 
Euclidean inner product, 2 
Euclidean metric, 5 
Euler characteristic 
independent of decomposition, 146 
of a compact orientable odd-dimensional 
manifold is 0, 200 
of a polygonal decomposition of a surface, 
145 
Euler class, 233 
existence of a geodesic, 107 
existence of a connection, 73 
existence of a Hermitian metric, 234 
existence of geodesics, 106 
exponential map 
of a connection, 115 
exponential map 
as a natural transformation, 126 
differential, 117 
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for a Lie group, 120, 122 
naturality, 116 
extension of algebraic identities, 307 
exterior derivative, 299 
exterior differentiation 
on R3, 302 
exterior algebra, 164 
exterior derivative 
of a vector-valued 1-form, 198 
of a vector-valued form, 190 
properties, 300 
exterior power, 164 
basis, 168 


F-bilinearity, 59 
F-linear map 


of sections correspond to a bundle map, 


58 

fiber, 242 

of a vector bundle, 49 
fiber bundle, 242 
first fundamental form 

coefficients, 35 
first Bianchi identity, 203 

vector form, 204 
first fundamental form, 35 
first fundamental form, 68 
first structural equation, 85 
flat section, 72 
form 

smooth, 188, 298 
forms with values in a Lie algebra, 191 
frame, 56, 174 

k-forms, 298 

of vector fields, 296 

positively oriented, 229 
frame bundle, 247 

of a vector bundle, 247 
frame manifold 

of a vector space, 246 
framed open set, 81 
free action, 243 
free module, 299 

rank, 299 
Frenet—Serret formulas, 15 
Frenet—Serret frame, 15 
functoriality 

of tensor product, 160 


fundamental theorem on symmetric 
polynomials, 312 
fundamental vector field, 247 
integral curve, 249 
right-equivariance, 248 
vanishing at a point, 249 
Fundamental vector fields 
Lie bracket of, 252 


G-manifold, 243 
Gauss curvature equation 

in terms of forms, 89 
Gauss curvature equation, 62 
Gauss map, 39, 40, 147 
Gauss’s Theorema Egregium, 19, 63 
Gauss—Bonnet formula 

for a polygon, 143 
Gauss—Bonnet theorem, 20 

for a surface, 145 

generalized, 233 
Gaussian curvature, 18, 67 

and Gauss map, 147 

in terms of an arbitrary basis, 64 


is the determinant of the shape operator, 


34 
of a cylinder, 40 
of a Riemannian 2-manifold, 91 
of a sphere, 40 
of a surface, 90 
of a surface of revolution, 41 
of the Poincaré disk, 114 
Poincaré half-plane, 94 
positive, 146 
generalized Gauss—Bonnet theorem, 233 
generalized second Bianchi identity, 204 
on a frame bundle, 274 
genus 
of a compact orientable surface, 146 
geodesic, 95, 103 
existence of, 106 
existence of, 107 
in the Poincaré half-plane, 108 
maximal, 103 
minimal, 130 
on a sphere, 104 
reparametrization, 105 
speed is constant, 103 
geodesic equations, 107 
geodesic curvature, 138 


signed geodesic curvature, 140 

total geodesic curvature, 141 
geodesic equations, 106 

of the Poincaré disk, 114 
geodesic polygon, 144 
geodesic triangle 

sum of angles, 144 
geodesically complete, 130 
geodesics 

on a sphere, 107 
geometric curve, 9 
germ 

of a function at a point, 295 
germ of neighborhoods, 127 
graded algebra, 299 
gradient vector field, 137 
Gram-—Schmidt process, 81 
graph 

curvature, 14 


helicoid, 37 

Hermitian bundle, 234 

Hermitian inner product, 234 

Hermitian metric, 234 
existence of, 234 

Hermitian symmetric, 234 


Hirzebruch—Riemann-Roch theorem, 239 


holomorphic tangent bundle, 236 
holomorphic vector bundle, 238 
holonomy, 113 
homogeneous elements, 163 
homogeneous form, 188 
homogeneous manifold, 244 
Hopf bundle, 244 
Hopf Umlaufsatz, 142 
Hopf-Rinow theorem, 131 
horizontal component 

of a form, 281 

of a tangent vector, 256 
horizontal distribution, 251, 254 

of an Ehresmann connection, 257 
horizontal form, 277 
horizontal lift, 262, 263 

of a vector field, 259, 266 
horizontal lift formula, 266 
horizontal lift of a vector field 

to a frame bundle, 266 

to a principal bundle, 259 
horizontal tangent vector, 264 
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horizontal vector, 262 
Horizontal vector fields 
Lie bracket of, 274 
hyperbolic plane, 114, 144 
hyperbolic triangle, 144 
hypersurface, 29, 66 
normal vector field, 29 
volume form, 137 


immersed submanifold, 304 
immersion, 304 
induced connection 

on a pullback bundle, 210 
inner product, 3 

Euclidean, 2 
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representation by a symmetric matrix, 3 


restriction to a subspace, 3 
inner products 
nonnegative linear combination, 3 
integral curve 
of a fundamental vector field, 249 
integral form, 226 
interior angle, 142 
interior angles 
of a polygon, 147 
invariant, 212 
invariant complex polynomials, 310 
invariant form, 280 
invariant polynomial, 212, 306, 308 
generators, 215 
isometric invariant, 63 
isometry, 1, 5 
isomorphic vector bundles, 51 


jump angle, 142 


L-polynomials, 314 
left action, 243 
left G-equivariant map, 243 
left-invariant metric, 124 
left-invariant vector field, 119 
Leibniz rule, 43 
length, 4 

of a vector, 2 
length of a vector, 128 
Levi-Civita connection, 45 
Lie bracket 


of a vertical and a horizontal vector field, 


260 


342 Index 


Lie bracket, 297, 305 
of fundamental vector fields, 252 
of horizontal vector fields, 274 
of vector fields, 28 
Lie derivative 
comparison with the directional derivative 
in R", 25 
Lie group 
exponential map, 122 
lift, 263 
horizontal, 263 
line bundle, 49, 235 
trivial, 235 
local operator, 53, 76 
restriction, 55 
local trivialization, 242 
locally Euclidean, 293 
locally finite, 6 
locally path-connected, 305 
locally trivial, 242 


Maurer—Cartan connection, 260 

curvature of, 274 
Maurer—Cartan equation, 198 
Maurer—Cartan form, 198 

right translate, 198 
maximal atlas, 293 
maximal geodesic, 103 
mean curvature, 18, 41, 67 

of a cylinder, 40 

of a sphere, 40 

of a surface of revolution, 41 
metric 

on the Poincaré half-plane, 93 
metric connection, 75, 234 

existence of, 76 

relative to an orthonormal frame is 

skew-symmetric, 83 

metric space, 129 
metric-preserving map, 5 
minimal geodesic, 130 
MObius strip, 50 
module 

free, 299 
morphism 

of principal bundles, 244 
multi-index, 298 

ascending, 298 

strictly ascending, 298 


natural transformation, 126 
naturality 
of the exponential map, 116 
of characteristic classes, 221 
of the exponential map for a Lie group, 
122 
naturality property, 221 
Newton’s identities, 317 
Newton’s identity, 215 
non-Euclidean geometry, 110 
nondegenerate pairing, 169 
normal coordinates, 118 
normal curvature 
average value of, 41 
of a normal section, 18 
normal neighborhood, 118 
normal section, 18 
normal vector, 17 
normal vector field, 17 
along a hypersurface, 29 
smooth, 17 


orbit, 243 
orientable vector bundle, 229 
orientation 
and curvature, 13 
on a vector bundle, 229 
on a vector space, 228 
orientation-preserving reparametrization, 
128 
orientation-reversing reparametrization, 128 
oriented vector bundle, 229 
orthogonal projection, 81 
on a surface in R?, 46 


pairing 
nondegenerate, 169 
of two modules, 169 
parallel translation 
existence of, 111 
parallel along a curve, 263 
parallel frame 
along a curve, 263 
parallel postulate, 110 
parallel translate, 111 
parallel translation, 111, 263 
on a sphere, 113 
preserves length and inner product, 112 
parallel transport, 111, 263 


parallel vector field, 110 
parametrization 

by arc length, 10 
parametrized curve, 9 
partition of unity, 6 
path component, 305 
permutation matrix, 311 
Pfaffian, 231 
piecewise smooth curve, 112 
Poincaré disk 

connection and curvature forms, 86 
Poincaré half-plane 

Gaussian curvature, 94 

metric, 93 
Poincare 

Poincaré disk 

Christoffel symbols, 102 

Poincaré disk 

Gaussian curvature, 94 
Poincaré half-plane, 144 
Poincaré half plane 

Gaussian curvature, 92 
Poincaré half-plane 

geodesics, 108 
point operator, 53 
point-derivation, 295 
polygon 

geodesic polygon, 144 

on a surface, 142 
polynomial, 212, 306 

Ad(G)-invariant, 287 

on gl(r, F), 308 

on a vector space, 287 
polynomial function, 306 
polynomial on so(r), 230 
polynomials 

versus polynomial functions, 306 
Pontrjagin class, 225 
Pontrjagin number, 226, 236 
positive orientation 

on a polygon, 142 
positive-definite, 234 
positive-definite bilinear form, 3 
positive-definite symmetric matrix, 4 
positively oriented frame, 229 
principal curvature 

is an eigenvalue of the shape operator, 33 
principal bundle, 244 
principal curvature, 18, 67 
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principal direction, 18 

is an eigenvector of the shape operator, 33 
principle of extension of algebraic identities, 

307 

product bundle, 50, 244 
product of vector-valued forms, 188 
projection 

orthogonal, 81 
pseudo-tensorial form 

with respect to a representation, 277 
pullback 

of a vector bundle, 177 

of a differential form, 303 

of a function, 303 
pullback of vector-valued forms, 193 
pullback bundle, 177 

induced connection, 210 

examples, 180 
pushforward 

of of a vector field, 98 


quotient bundle, 177 


rank, 299 

rank of a vector bundle, 49 

real invariant polynomials, 315 
generated by the coefficients of 

characteristic polynomial, 317 

generation, 319 

regular curve, 9 

regular point, 29 

regular submanifold, 5, 17 

regular value, 29 

reparametrization, 9, 128 
orientation-preserving, 128 
orientation-reversing, 128 

reparametrization of a geodesic, 105 

restriction 
of a connection to an open subset, 76 
of a form to a submanifold, 304 
of a local operator, 55 
of a vector bundle to an open set, 178 
of a vector bundle, 50 

retraction, 252 

Ricci curvature, 208 

Riemann curvature tensor, 207 

Riemannian bundle, 74 

Riemannian connection, 45 
existence and uniqueness, 45 
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in terms of differential forms, 85 
on a surface in R3, 47 
Riemannian manifold, 4 
is a metric space, 129 
Riemannian metric, 4 
existence, 6 
on a manifold with boundary, 133 
on a vector bundle, 74 
right action, 243 
right G-equivariant map, 243 
right-equivariant form 
with respect to a representation, 277 
rotation angle theorem, 143 
rotation index theorem, 143 


scalar curvature, 209 
second Bianchi identity 
in vector form, 207 
second Bianchi identity, 204, 271 
generalized, 204 
generalized, on a frame bundle, 274 
second fundamental form, 35, 68 
coefficients, 36 
second structural equation, 85 
section 
of a vector bundle, 51 
sectional curvature, 92 
sections 
of a vector bundle along a curve, 262 
sesquilinear, 234 
shape operator, 30, 66 
and curvature, 32 
is self-adjoint, 31 
matrix is symmetric, 32 
of a cylinder, 40 
of a sphere, 40 
of a surface of revolution, 41 
short exact sequence 
rank condition, 252 
splitting, 252 
shuffle, 172, 297 
sign convention, 197 
signature, 238 
signed curvature, 12 
signed geodesic curvature, 140 
signs concerning vector-valued forms, 197 
singular value, 29 
skew-symmetric matrices 
powers, 224 


skew-symmetric matrix 
diagonal 0, 224 
smooth dependence 
of a form on f, 216 
smooth form, 188, 298 
smooth function, 293, 294 
smooth manifold, 294 
smooth map, 295 
smooth vector field, 296 
smoothly varying 
family of forms, 216 
Ad (SO(r))-invariant polynomial, 
229 
speed, 10, 103 
sphere 
geodesics, 107 
geodesics on, 104 
mean and Gaussian curvature, 40 
shape operator, 40 
volume form in Cartesian coordinates, 
135 
spherical coordinates, 135 
splitting, 251, 252 
stabilizer, 243 
straight, 95 
strictly ascending multi-index, 298 
structural equation 
first, 85 
second, 80, 85 
subbundle, 174 
subbundle criterion, 175 
submanifold 
immersed, 304 
regular, 5 
summation convention 
Einstein, 80 
support 
of a function, 6 
surface 
in R3,5 
surface in R? 
covariant derivative, 104 
Gaussian curvature, 90 
surface of revolution 
Christoffel symbols, 102 
mean and Gaussian curvature, 41 
shape operator, 41 
symmetric bilinear form, 3 
symmetric connection, 100 


symmetric polynomials 
fundamental theorem, 312 
generation, 318 
symmetric power, 173 
symmetries 
of curvature, 205 


tangent bundle, 51 
tangent space, 295 
tangent vector, 295 
tensor, 91 
tensor product 
basis, 156 
tensor algebra, 162, 163 
tensor criterior, 196 
tensor field 
covariant derivative of, 206 
tensor field on a manifold, 195 
tensor product, 152 
associativity, 161 
basis, 156 
characterization, 154 
dimension, 157 
functorial properties, 160 
identities, 158 
of finite cyclic groups, 159, 163 
of three vector spaces, 161 
tensorial form 
of type p, 277 
Theorema Egregium, 19, 63 
using forms, 90 
third fundamental form, 68 
topological manifold, 294 
torsion, 44 
is F-linear, 44 
of directional derivative, 24 
torsion forms, 84 
torsion-free, 45 
in terms of Christoffel symbols, 
100 
torsion-free connection, 100 
torus 
as a Riemannian manifold, 5 
total curvature, 40, 146 
is a topological invariant, 146 
of a plane curve, 147 
total geodesic curvature, 141 
total Pontrjagin class, 226 
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total space, 242 

of a vector bundle, 49 
trace 

derivative of, 215 

of a bilinear form, 209 
trace polynomial, 213, 215, 306, 310 
transition functions, 245 
transitive action on a fiber, 244 
transposition matrix, 311 
triangle 

sum of angles, 144 
trivial line bundle, 235 
trivialization, 49, 72 
trivializing open cover, 50 
trivializing open set, 49 


Umlaufsatz, 147 
unit-speed polygon, 142 
universal mapping property 
for bilinear maps, 154 
universal mapping property 
for alternating k-linear maps, 166 
for bilinear maps, 153 
of the tensor product, 153 


vanishing 
of characteristic classes, 223 
vector bundle 
associated to a representation, 275 
vector bundle, 49 
base space, 49 
holomorphic, 238 
isomorphism, 51 
total space, 49 
vector field, 296 
along a curve, 25 
along a submanifold, 26 
left-invariant, 119 
on a submanifold, 26 
parallel, 110 
vector subbundle, 174 
vector-valued forms 
product, 188 
vector-valued k-covector, 186 
vector-valued differential forms, 186 
vector-valued form, 186, 187 
directional derivative, 190 
pullback, 193 
velocity vector field, 25 
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vertical component, 254 on H?, 133 
of a tangent vector, 256 on R?2, 133 
vertical subbundle, 251 on the boundary, 133 
vertical tangent space, 250 
vertical tangent vector, 250 wedge product, 164, 298 
volume form is anticommutative, 165 
of a smooth hypersurface, 137 properties, 164 
of a sphere, 135 under a change of frame, 132 
of a sphere in spherical coordinates, 135 wedge product formula, 172 
of an oriented Riemannian manifold, 133 Weingarten map, 30 


on a circle, 134 Whitney product formula, 226 
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